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ABSTRACT 


This thesis represents the results of an investigation 
into the implementaticn of a "“one-pass compiler" for a 
modified version of the programming language ALGOL 68. 
Modifications made to the original language consist of 
additions and alterations that are considered to enhance the 
language from bcth a user's and an implementor's point of 
view. These additions and alterations are discussed, with 
examples, anda syntax tor the modified version of ALGOL 68 
is presented. Implementation dependencies and restrictions 
imposed to facilitate one-pass implementation are discussed, 
and techniques for some of the tasks which the ccmpiler must 
perform are described. Special emphasis is flaced on those 
CaSkSeePeCculiarwato. a) buts note a "multi-pass" 


compiler. 
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CHAPTERS. 


INTRODUCTICN 


1.1 Introduction 


The Repert on the Algorithmic Language ALGCL 68 [1] 
(hereinafter referred to as the Report) defines a new 
programming language intended to be the official successor 
to ALGCL 60 [2]. In ccmparison with ALGOL 60, ALGCL 68 is 
mere powerful and has wider applicability. New data types 
("modes") as well as new language constructions and 
operations are introduced. With this greater complexity, a 
more complicated descriptive technique is used. The 
deseription cof “ALGOL® 68° %consists Tots 7a syntactic *@part, 
formulated using a "two-level" syntax, and a semantic and 


pragmatic part, formulated in the English language. 


Since the publication of the Report, many proposals 
ccncerning ALGOL 68 have been made. It 1S nee eis. 
objective of this thesis to comment on ALGOL 68 and some of 
the proposals for additions and alterations. The second 
objective is to describe a proposed "“one-pass compiler" for 


a large sutset of ALGOL 68. CommentS on additions and 
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alterations are made in the light of experience gained while 


designing the one-fass iaoplementation. 


The thesis presupposes considerable familiarity with 
ALGCL 68. ‘The first occurrence of any technical term or 
notion from the Report will appear within quotation marks, 
and where necessary, will be followed by a reference to the 
particular section of the Report in which the definition may 
be found. Short descriptions and examples are given 
frequently to illustrate ands char yey. chcaonceptsvebeing 


discussed. 


Chapter II of this thesis is concerned with commenting 
on proposed additions and alterations to the language and 
developing a mcdified syntax for ALGOL 68. This syntax 
appears as Appendix A. Chapter III investigates general 
aspects cf the ccmpiler and discusses restrictions imposed 
to facilitate cne-fpass implementation. A description of some 
of the modules of which the compiler is comprised appears in 
Chapter IV, along with discussion of some of the major 
implementation difficulties. Finally Chapter V_ presents 


conclusions gained from the investigation. 


Unless ctherwise noted, terminology and notation used 
is the same as that of the Report. References to specific 


parts of the Repert appear between brackets and are given as 
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R followed by the section numbers and then, if necessary, a 
letter indicating the section or paragraph to which 


reference is keing made. For example, [R1.3] or [R3.0.1.b)]. 


ALGOL 68 is defined in three stages; the "strict 
language", the "extended language", and the "repesentation 
language" [R1.1]. The syntax of the strict language is a set 
of “production rules" for “notions". Sets of rules’ for the 
language are ccmkined into single rules with the aid of 
"metanotions", the production rules for which are given by 
the "metaproducticn rules" {R1.2]. A particular production 
rule is oktained from a rule of the Report by replacing all 
cccurrences cf a metanction contained in the rule with the 
same terminal production of the particular metanotion. This 
is repeated for all metanotions throughout the rule. For 
example, 

‘actual real field letter t digit one declarator: virtual 
real declarer, real field letter t digit one selector.' 

is a particular production rule obtained from 

"‘VICTAL NCNSTCWED field TAG declarator: virtual NCNSTOWED 
declarer, NCNSICWED field TAG selector.' 

VICTAL, NCNSICWED, and TAG are metanotions and have _ been 

replaced throughout by the terminal productions "actual", 


"real", and “letter t digit one", respectively. 


Note that when reading rules of syntax ":" may be read 


aa atmay recat, icllowedehby a", and "3" as "or a. 
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Detailed consideration will be given to the extended 
language and the representation language in Chapter II. The 
representation language used for examples given in this 


thesis is similar to that used in the Report. 
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CHAPTER II 


CCNSILCERATION OF MODIFICATIONS TO ALGCI 68 


Zeal Introduction 


The purpose of this Chapter is to discuss and comment 
on some proposals for additions and alterations to ALGOL 68. 
A syntax incorporating those additions and alterations 
thought to enhance the language appears as Appendix A of 


this thesis. 


The following quotation is extracted from a formal 
resolution made at a recent meeting of Working Group 2.1 
(ALGCL) 1 in Novosikirsk. 


Moree WG2 ee WOCKING es GLOUp 2.) |e ineclose 
ccllakcration with many implementors and 
retential users - is considering several 
Eroposals for correction, revision and also 
expansion of the language defined. These 
Froposals may be classified in order of 
increasing degree cf departure frem the 
present language as follows: 


1. Corrections and clarifications to the 
text of the Report without change to the 
bedy of the language, but with correction of 
’Working Group 2.1 4(ALGOL) ® directed by Technical 
Committee 2 (IC2) of the International Federation for 
Information Processing (IFIP), commissioned and guided the 
writing of the Report on ALGCL 68. 
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those things which were obviously in error. 

2. Changes of a notational nature within the 

existing conceptual framework of the 

language, in order to ease implementation 

and improve clarity. 

3. Generalization of existing concepts and 

introducticn of some new ones, so as to 

increase the power of the language and cpen 

possitilities for future enhancements. 

u“. New ideas, —not» ripe for inclusion in 

ALGCL 68. ———— [oa 

These categories are referred to during discussion of 

certain propcsals within this Chapter. No proposals clearly 


belonging to category four have been included in the 


modified syntax. 


During the course of this investigation, many of the 
propesals for additions and alterations to ALGCL 68 were 
studied. While some of these proposals were adopted because 
they were considered to improve the language, cthers were 
rejected because they belonged to category four, for example 
[4-LQ215, 5-1Q201], or because they were thought to be 
unsuitable, from either a user's or an implementor's point 
of view, for incorporation into the language. Proposals 
considered within this thesis are referred as near to their 
source as possible from available documents and literature. 
Proposals not referenced may he considered to arise from 
tis investigation (though perhafs simultaneously with 


similar proposals arising elsewhere). 
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Attached to most sections of syntax in the Report are 
semantic descriptions. With the syntactic modifications that 
are inccrporated, corresponding modifications to the 
semantics are usually necessary. There are also some 
semantic changes belonging to category one that should be 
made, tor example [6-proposal 8]. However, except for occa- 
Sional mention cf how the semantics might be modified, these 
semantic changes will not be further discussed in this 
thesis. Also proposals ccncerning "transput" [R5.5.1.aa] and 
associated matters (R10.5] are not considered in this 


investigation. 


One of the major criticisms directed at the Report 
ccncerns the manner of presentation. In the preparation of 
this thesis the Report has been used as the defining 
document while the companion volume [7] has teen used as a 
primer. In this manner, anticipation of the teaning of some 
sections of the Report was achieved and the task of 


comprehension relieved. 


The first three chapters of the kKeport are 
SoeLCauctoLy chapters s= Chapter. GO .1si thes. "introduction", 
Chapter 1 describes the “Language and Metalanguage", and 
Chapter 2 defines relaticnships between "The Computer and 


the Program". 


The syntax of the wmetalanguage has been modified 


scmewhat as otter syntactic changes have been introduced. In 
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Ghaptept20a"' particular=program" [R2.1.4d] becomes a "serial- 
clause" [{R6.1] enclosed between begin and end or hetween 


parentheses as suggested in [8-proposal 9]. 


The topics discussed in the remainder of this Chapter 
are presented in the same order and with the same titles as 
the Chapters of the Report, ccmmencing at Chapter 3. Note 
that the syntax changes associated with some proposals may 
extend over Charters of the Report other than the one 


currently reing discussed. 


Chapter 3 of the Report introduces the concept of a 
representation language, strongly suggesting rather than 
explicitly prescribing the refpresentation(s) cf symbols. The 
representation language LOG the proposed one-=pass 
implementation to be discussed in this thesis is given in 


Appendix C. 


Several modifications have been made to the syntax of 
Chapter 3 and to representation choices. It should be noted 
that "indications" [R4.2] have been kept separate from non— 
redefinable representations and that there is only one 
representation per indication. In the present keport for 
example, not and -~ are given as representations of the "not- 


symbol", This means that if the following "“operation- 
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declaration" [k7.5] is made; 
Cp not = (bocl b) bool: true 
then the meaning of the “operator" [R4.3] +> is altered as 
well and this could be somewhat ccnfusing to the user. Under 
the profosed nodification,2tnotitand @-fewiliveanitaally 
Hrossesso( R2i2s 26dq/theasanes"routine"i PR2.2.2.f£, R2e2.3.4] 
but after the akove declaration, — will remain as declared 
a the "standard-prelude" [R2.7.b, R10] (the declaration of 
Standard “"mode-identifiers" [R4.1], indications, operators, 
and "frocedures" [R6.0.1.f£]). This is easier to implement as 
no links need te kept between the representations - and not. 
For ease of writing operaticn-declarations in which the same 
routine is possessed by different "adic-indications" 
Ruel. led |, a proposal for “operator—bases" [o-proposal 30D] 
might be adopted though this has not been included in the 
modified syntax. Then the following would be fossible; 
Op not = (bool b) bool: (b|falseltrue) ; 


Obes sm (DOL): 


Fuli length representations of symbols that were 
akbreviated within the Report have been intrcduced. For 
example, procedure and froc are both representations of the 
"procedure-symbol". This is done at the request of some 


potential users posandticthers jJr5*Ineether*case® "of* the 


"pnriority-symbcl", a shortened representation is added. 


There are other representations that have been added 
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in accordance with current proposals [10, 4-1¢207A] and with 
proposals to ke discussed later in this Chapter. Since these 
changes involve representations they are all of category 
two. The "lexical analyzer" to be described in Chapter IV 
will be constructed so that it will be a relatively simple 


process to alter and/or add representations. 


An okjection to this representation language is the 
chosen tet ropping" convention (see Appendix C). Many 
potential users have expressed annoyance at having to type 
an apostrophe tefore certain sequences of letters and 
digits. Thus in the design of the compiler, care has _ been 
taken not to make special use of the apostrophe so that a 
"reserved-wcrd" representaticn language may be investigated. 
The reserved words would have blanks or special characters 
(characters other than letters or digits) as delimiters. 
Currently, there are five representations that might be 
confused with standard "identifiers" or "TAGs" [R4.1] ina 
reserved-word implementation. Alle Vand aout! possess 
routines defined in the standard-prelude while in and out 
are respective representations of the "in-symbol" and the 
"out~symbcl". "rel and tint are "field-selectors" 
Dive lat. lyadetined inate standard-prelude while re and in 
are respective representations of the "real-part-of-symbol" 
and the “imaginary-part-of-symbol". Finally, “letter e 
letter x letter i letter t" [R2.1.e] might cause confusion 


with exit, a representaticn of the "“completion-symbol". 
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These cases would have to be resolved if a reserved-word 


implementation was being considered. 


Chapter 4 of the Report introduces identifiers, 
indications, and operators, describing methods for their 
identification and context conditions which a program must 
Satisiy to ke a “proper” program. "Applied occurrences" of 
identifiers (indications and operators) identify "defining 
eoccurrences" of identifiers (indications and operators) 
found Ly methods described in this Chapter cf the Report. 
The identification processes make use of "ranges" [ k4.1.1.e] 
which have much in common with what in some other 
programming languages are known as "blocks". In particular a 
range defines the "scope" [R2.2.4.2] of the "names" 


PROG2.2.1, R2.2,.289] Created by declarations within at. 


Because it would reguire a special check to test 
whether certain ‘“mode-indications" [R4.2.1.b] could be 
redefined or not, it was decided to allow the primitive 
mode-indicaticns, int, real, kool, char, and format to he 


redefined. Thus these five mentioned indications become 


“"mode-standards" [R4.2.1.c]. 


The "“short-symbol" is introduced and syntax changes 


are made so that as well as being able to have "long modes" 
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[R2.2.4.1], for example long real, it is possible to have 
HShorteg modes", for exanpleigshopttint~[{dua-proposaleun-eDhis 
dives an implementor the choice of internal representations 
for the mode-standards, int, real, bits, and Lytes, while 
not restricting him from accessing shorter lengths should he 
not initially select the shortest length of internal 
representation on the physical machine. Note that it is now 
allowable to have any number of short-symbols or any number 


of "icng-symkcols" commencing an adic-indication. fThis 


unifies the implementation approach to adic-indications. 


Scme changes are incorforated for "dyadic-indications" 
[htaget.d.| and “Operator identification. Consider the 


fcllowing example; 


The fat st occurrence “of “+ “is" ean indication-defining 
occurrence. The second occurrence is an oOperator-defining 
and an indicaticn-applied occurrence. The third is an 
Operator-applied and an indication-applied occurrence. At 
present the operator-defining occurrence, as an indication- 
applied occurrence, must identify an indication-defining 
cccurrence. It is proposed that in operation-declarations, 
no priority be associated with the operator-defining 
occurrences. Friorities are associated only with operator- 


applied cccurrences (as indication-applied occurrences). 
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Thus the second + above would be an operator-defining 
eccurrence "but ‘not an indication-applied occurrence. 
Necessary syntax changes have been incorporated and the 
ccrresponding alterations to the semantics, and to the 
process of operator identification, are not difficult. With 
the proposed changes, the first occurrence of + above is a 
terminal production of “priority-FIVE-dyadic-indication", 
the second a terminal production of "PRAM-dyadic-indication" 
and tke third a terminal production of "PRAM-priority-FIVE- 
dyadic-indication". Examples such as the following are now 


permitted; 


egin op min = (int a,b) int: (a<blalkr); 
1#Sccndition then priority win = 5; 


print (3+2min4) 

print (3+2min4) fi; , 

where "condition" is of "boolean" mode. In the present 
language, this wculd viclate a context condition [RS.45 12c] 
and this context condition, because of the incorforation of 
the syntax changes, has been dropped. If "condition" in the 
above example is true, "4" will be Printed, ity talse eso! 
will te printed. This proposal originated as a by-product of 
another proposal [4-LQ215]. if included independently, then 
a change to the definition of "protection" [R6.0.2.d] is 


necessary. Frotection is the mechanism which allows 
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unhampered definitions of identifiers, indications and 
Operators within ranges and permits a meaningful "call" 
{R8.6.2] within a range, of a procedure declared outside it. 
This is because in the semantics describing a call the 
actual procedure body is copied in place of the call. This 
is not in fact how calls would ever be implemented and it 
would seem fetter to eliminate the concept cf protection, 
replacing it with semantics more closely parallielling 
inplementation technigues. Nevertheless with the above 
scheme it is possible to redefine protection to allow the 
copying cf frocedures but there will be cases where, after 
protecticn, the identification process will still have to be 
Carried out for dyadic-indications in order to ascertain 


their pricrities. 


In ALGCL 68, operators are identified not only by 
their indications but also by the modes of their “operands" 
[R8.4.1.c]. Currently, a frogrammer is not permitted to 
define two operators whose operands possess modes that might 
cause confusion. Such operands are said to have “loosely 
related" modes [R84.4.2.c]. It is proposed to replace the 
loosely related condition with the condition that a program 
is not proper if there exists an operator in a “formula" 
[R8.4] which identifies two operator-defining occurrences 
[4-LQ205A]. This resolves some ambiguities [12] and is not 


as restrictive as the current language. Under this new 
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proposal, operators with operands whose modes are loosely 
related will ke permitted, unless an applied cccurrence of 
one of the cperators actually does lead to ambiguity. An 
increase in overhead may be expected in cases where there 
are several operators with the same indication as the 
applied occurrence declared within the same "reach" 
[R4.4.2.a] (a range excluding all ranges ccntained within 
it). All defining cccurrences in this reach will have to be 
checked; the search may not terminate when the first 
applicable operator is found. However, when checking 
operators whose defining cccurrences are in the standard- 
prelude, the search may terminate on finding the first 
applicable operator, because applied ocurrences of operators 
declared within the standard-prelude are known not to lead 
to ambiguities. With the incorporation of this double 
identification context conditicn, there may he programs 
written in a subset of ALGOL 68 that are not proper programs 
in revised ALGCL 68. This is because ambiguities may arise 
due to the, effect of certain features of ALGCL 68 not 
included in the subset. With the definition of "sublanguage" 
as in the Report {R2.3.c] this wculd mean that the subset 


was not a suklanguage. 


Finally an this Chapter, another condition for 
"shielding" [F4.4.4.a] (avoiding the description of modes 
which require an infinite amount of storage) is added in 


that a mode-indication will be shielded if it is contained 
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ii ttaneitdctiak=boundig offhepe "declarer" 4 i[eR7 
allow, as pointed out in [8-froposal 5], the following; 


gmteinswstd's if ) weal 2a; 


— — * See eS 
ae 


a{jj:=heap m fi; 6) ] real; . 
This is equivalent to the following example; 
int«i:=10;°(1:10] ref [ ) real a; 


Erocetbhc= (int cjjavoid: tafe} j:=heap tn; 


f=! 
0 
{au 
lo 
1S 
fT) 


Vs (tess; i2f iz0ethen b(i) £1; 6) ] real; - 
This example is currently proper until the elaboration of 
the call when, due to textual replacement, context 
conditions beccme violated. No extra implementation probiems 
are created, as the elaboration of an "actual-declarer" will 
involve the execution of a corresponding subroutine and this 


subroutine may ke recursive. 


2.4 Denotations 


Known in other languages as "literals" or "constants", 
denotations are terminal pfroducticns of notions whose value 
is independent of the "elaboration" [R6.0.2] of the progran. 
As published, the Report includes an unsatisfactory method 
cf cconstructing "kits-denotations" [R5.2]. Essentially a 
bits-denotation is defined as being a sequence of "flip- 


symbols" and/or "flop-symbols". Actual representations of 
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bits-denotations using these Symbols are unwieldy, for 
exaupie;, » 1t0' té4ntormitio1icasa proposal for Ucadix= 
denotations" [13-proposal 7], allowing more general bits- 
denotaticns with radices 2, 4, 8 or 16, is adopted; for 
Sxanpie, #2251000 ,ce4ni23apalsc7605e, Aér5a9fuleathe Letters ‘a 
through f are used for the hexadecimal "digits" 10 through 
15. With the acceptance of this proposal the flip-symbol and 
the flop-symkol are ne lenger required and minor changes 
must be made te fcrmatted transput with the introduction of 


PELTtS=patterns"upr5s5 95 


In ALGOI 68, routines and formats are recognized to be 
data objects and hence there exist "routine-denotations" 
pease is oand —"forpat—denotaticons" @liho.o |. elne syntax Lor 
routine-denotations is altered according tc froposals to be 


discussed in sections six and seven of this Chapter. 


With the introduction cf short modes aS mentioned in 
the previous section, "short-integral-denotations", "short- 
real-denotations", and "short-bits-denotations" are per- 


mitted. 


— Se See eS 


Phrases are "declaraticns" or "clauses". These are the 
bricks which make up a serial-clause and thus a particular- 
program. Three types of clause defined in Chapter 6 of the 


Report are the "“collateral-clause" [R6.2], the "closed- 
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clause" [R6.3], and the "conditional-clause" [R6.4]. "Case- 
clauses" and "conformity- case-clauses" are introduced in 
the Report by means of "extensions" [R9.4]. However in the 
Syntax presented in Appendix A these case-clauses are 
unified with the conditicnal-clause, as formulated in [14]. 
All extensions concerning these clauses are included in the 
Syntax as is the proposal to require the matching of symbols 
in ccnditicnalx, case-, and conformity-case-clauses. For 
example, in a particular conditional-clause, it is 
permissikle tc use symbols from if, thef, then, else, elsf, 
ieee Ores T LOM (ply) | ead ©), sbut nOtm@eLrome Doth. This 
required matching has been applied to other ccnstructions in 
the syntax, for example, the "quote-symbols" delimiting 
"Character-denotations" [R5.1.4] and "string-denotations" 
{Ro.3)], the “suk-symbol™ and "“bus=symbol" of "slices" 


fe. 6.4, etc. 


In ALGCL 68 it is fossible to have a mode "united 


from" [R4.4.3.a] other modes. For example, in 


m is united from the modes int and real. It is the purpose 
of "ccnformity-relations" [8.3.2] to enable the programmer 
to find out the current mode of an "instance" [R2.2.1] of a 
"value" [R2.2.3] if the context permits this mode to he one 
of a number of given modes. With the introduction of the 


conformity-case-clause, which performs the same PUnCCLON. ve 
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would seem that the ccnformity-relation is no longer 
necessary and it has been omitted from the syntax of 
Appendix A. However, perhaps after more experience of 
programming in ALGOL 68, it may be found that a need does 


exist in the language for ccnfcrmity-relations. 


The form of the proposed conformity-case-cliause is 


illustrated Ey the following example; 


Note that the following example is permitted 


casec unionintreal in (bool b): skip, 
(char c): skip 
OUteskip, Cesac; 


in which case the “out-clause" is always taken (exactly as 


"unionintboolreal" and 
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“unionintreal" are identifiers with obvious modes. The 
Pant) eet sivnboolt b) yeandem(charsc) x" ine thevabevet examples 
are called "specifications". A required context condition 
for the introduction of conformity-case-clauses is that no 
EWoespeciftications 10 tne Nconrtormity-units! of se tie iii 
Clause" of a conformity-case-clause may specify "related" 


{R4.4.3.k] mcdes. For example, in 


if "unicnintboolreal" possesses a value of integral mode 
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then it is not clear which alternative to take. The above 
context condition rules cut situations such as this because 
upbion({int, real) and int are related modes. Note the 
Optional identifier in the specifications. If the instance 
cf the value is to be used within the alternative, then an 


identifier, which is then made to possess that instance, may 


be included; if not, it may be omitted. 


No change is made to the ranges of conditiconal- or 
case-clauses {15-proposal 18]. Thus in 
the "x" adn the in-clause (the "then-clause") will not 
adentify the "x" declared in the "conditional". The 
alternative is tc write 

begin string x; 
The arguments for the proposed ranges do not seem strong 
enough tc warrant changing the present Situation. With the 
intrcduction of specifications within conformity-case- 
clauses, many of the examples using the proposed ranges are 


accounted for. 


In Chapter 6 of the Report, "row-displays" and 
"structure-displays" (R6.0.1] are introduced. For example, 
in 

{1:3] real x,y; 


x3= (1,2, 3) 3 
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Y2=XOCS PSAs , 425); 

CUS 3) evandt "(Qsethpassyp omave row=displays ‘and ('% "cand 
“y" are cf mode "reference-to-row-of-real". In 

Struct (int 1, real x, char c) 2; 

Boal PFAES Pp" SN") = 
"(1,2.3,"c")" is a “structure-display" and "z" is of mode 
"reference-to-structured-with-integral-field-letter-i-and- 
real-field-letter-x-and-character-field-letter-c". 
At present it is ypermitted to have row-displays but not 
structure-displays as operands within formulas. With the new 
context condition for the identification of operators it is 


possiktle to allow structure-displays as operands because any 


ambiguities cccurring will be found at the time of operator 


identification. However, the time taken to identify 
operators may ke substantially increased if this is 
permitted. The syntactic changes required to allow 


structure-displays as operands are introduced in the syntax 
of Appendix A. Because of the practical advantages of having 
structure-displays as operands, it is planned to compare 
efficiency of operator identification between the cases when 
this is permitted and when it is not. Further discussion on 


displays appears in section seven of this Chapter. 


2.6 Unitary Declarations 


— — — — —— — Se —— 


Unitary declarations are of four types. "Mode- 
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declarations" [Rleza provide the indication-defining 
cccurrences of mode-indications, "priority-declarations" 
PRicay Erovide the indication-defining occurrences of 
dyadic-indications, "“identity-declarations" [R7.4] provide 
the defining cccurrences of mode-identifiers, and operation- 
declarations previde the operator-defining occurrences of 


Operators. 


When specifying modes in ALGOL 68, declarers are used. 
Teclarers are of three types: "actual", "formal", and 
"virtual" [R7.1j, depending on the kind of "bounds" which 
are permitted. An actual-declarer describes an instance of a 
value of the corresponding mode while a virtual-declarer 
describes the corresponding mode only. fFcrmal-declarers 
currently have the greatest freedom and can ke used to check 
the ccmpatibility of bounds between formal-declarers in 
"formal-parameters" [R5.4.1.€ ] and actual-declarers in 
“actual-parameters" (R7.4.1.b]. However, apart from 
specifying the "flexibility" of the bounds in "multiple 
values" [R2.2.3] (spoken of as "arrays" in some other 
languages) possessed by the corresponding actual-parameters, 
formal-declarers do not yield very much. Their use for 
checking ccmpatibility of bounds is limited and a burden to 
the implementor. Cperators lwb and upb are defined in the 
standard-prelude and these may be used to check bounds, if 
desired. It is therefore proposed to have only actual- and 


virtual-declarers {5-proposal Ldils However, the 
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Specification cf flexibility remains to be considered. It is 
Proposed to specify flexibility in the mode, for example 
"flexible-row-of-real" specified py flex [] real, thus 
making flexibility a property cf the whole multiple value, 
not of just one dimension. The either case (bounds which may 
be "fixed" or flexible), if needed, may be represented by 
union({ ] real, flex [ ] real). The modifications required for 
“Tlexyable-rcwsof-"tetmodess) are, kadvfficult to formulate and 
represent scme guite extensive changes to the Report (20 
would ke a category three proposal). Attempts at a 
formulaticn allowing comparable use of flexikility to that 
described in the Report have not been successful and formal- 
declarers remain in the syntax of Appendix A. However, their 
function has been reduced to specifying the flexibility of 
kcunds and apart from this they are the same as virtual- 
declarers. This proposal, together with another to forbid 
"go-on-symbols" in actual- and fcrmal-parameter-packs [16] 
will prove useful in the "syntax analyzer" to ke described 
in Chapter IV. These go-cn-symbols were included to allow 
"side effects" as in the following example; 
proc p = ({1:} real a; {1:upb a] real b) real: skip; - 

However, they ccmplicate the calling mechanism and if 


formal-declarers are modified as proposed, then there is no 


use for the go-on-symbol in formal-parameter-packs. 


To improve "orthogonality" [R0.1.2], a new class of 
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nodes, @ifomasexauplene [jek] izealpalis nintreduced sito) the 
language [(13-frogosal 5]. The declarer [,] real specifies 
“row-crow-of-real" while [] [] real specifies "“row-of-row-of- 
Pealligthus ®2it. 2is ] nowenpossibleiitoesiaveinataivector tat 
vectors", for example, {] string. Given the declaration, 

Eli npismie apis’) ceabax 
Stee LS “Yossiblesgtta Vepeams of titzie. 2 5 it ore "xP, 9 "or 


Maeva jpkitp butcnotec£ o'xfii f7,kgh. 


To improve clarity for both the syntax analyzer and 
the user, the "void-symbol" is introduced for "virtual-void- 
declarers" (R7ele tat ö 29.5 (Fore scexaupile,r ea 
routine-denotation of a routine which has an integral-mode 
farameter and which dces not return a result may ke written; 

(int a) void: skip. 
void, however, does not become a primitive mode such as a0t 
or real (ncw wode-standards). It may be that the syntax of 
the Report is simplified by the proposal to make void a mode 
[13-proposal 4}, but the concept of a "“void-mode" is 
somewhat confusing. In the Report, void has all the 
properties of an empty class of values and thus there would 
be no instance of a value of the mode void. It is not 
logical to declare 
void empty = skip 
as ‘“empty" would then possess a value which is a member of 
the empty class of values, and this is contradictory. Since 


the class of modes denoted by void would be a subset of all 
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other classes cf modes then it would not ke logical to 
distinguish the modes union(int, real) and union(int, real, 
void). If void is considered to be a structure with no 
"fields" [R2.2.2.k] then there will exist instances of the 
mode void and these will occufy zero storage. Implementation 
proklems then arise, for example, consider 

[1:2] void a;°a[1}:=:a[2];.. 
Normally the space occupied ty an element of a multiple 
Vaiuciwill keseqtaletovthes"'stridel ppR2e2 53 '3eb} "ofoits ilast 
dimensicn. In crder to be consistent with this, the "row-of- 
void", whose elements occufy zero storage, should have a 
stridencf«zero.ofhus¢a[ 1] and‘a[}2]-willtrefer to: the* same 
address and the value cf the “identity-relation" [R8.3.3] 
apbi cseafs2 Jouwil b be true, unlike similar language 
constructs. It would seem that void is only "virtually" a 


mode and thus it remains a virtual-void-declarer. 


2.7/7 Unitary Clauses 


ee See ae — — — — Se 


Unitary-clauses (often abbreviated as "units") are the 
entities in the language which actually get things done. 
Unitary-clauses are either "coercends" [F8.2] or closed-, 
collateral-, or case-clauses. In the syntax cf Appendix A 
there are five kinds of coercends. These are routine- 
deénotations, "confrontations" [R8.3], formulas, "cohesions" 


[R8.5}, and "kases" [8.6]. Each of these coercends may have 
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any one; -of) the eight basic "“coercions" applied to it 
depending on its centext, or Syntactic position. These 
ccercions are “dereferencing" [R8.2.1)}, "“deproceduring" 
{R8.2.2], “proceduring" (Res 2.545 "uniting" PRBs 2, 
"rowing" [R8.2.5], "widening" PksaceO lyme itpping | Reseed, 
and "voiding" [R&.2.8]. In the present Report there are five 
Syntactic positions: "strong" positions, where all coercions 
may be applied; "firm" positions, where the first four 
coercions listed above may be applied; "weak" positions, 
where the first twce may ke applied; "soft" positions, where 
Only deproceduring way be applied; and "EMPTY" positions, 
where no ccercions may be apflied. With the proposal to 
abolish ccnfcrmity-relations, the only position in which 
coercions could not be applied (the right hand side of a 
conformity-relation) disappears and when ccnfcrmity-case- 
clauses are introduced a new "meek" position replaces this 
"EMEFTY" position. The meek position is essentially the same 
as weak except that it is possible to completely dereference 


the coercend. 


Several proposals concerning coercion and belonging to 
category one have been incorporated into the syntax of 
Appendix A. These include the modifying of deproceduring as 
proposed in {17] to resolve an ambiguity; the allowing of 

{] proc int pp = (i-j)i Pp; 


in which "pp" is voided [18-proposal c]; and the hipping of 


"vacuums" [K&.2.€.1.b] to resolve an ambiguity, as proposed 
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in [8-preposal 1). 


Por clarity and ease of recognition, vacuums are now 
represented Ey an “open-symbol" followed by a "close- 
Symbol". Thus vacuums appear as empty row-displays. Further, 
if the symbol row were required to precede row+displays, for 
example row(1,2,3), then row-displays of one element could 
re introduced. This would mean that a vacuum could te COW (), 
that the rowing coercion could be eliminated, and that firm 
structure-displays cculd be permitted without affecting 
Operator identification to as great a degree as described in 
section five of this Chapter. Correspondingly if array were 
required to precede declarers specifying "row-of-" modes 
then reccgnition of these would be simplified (see Chapter 
Iv). At the same time, orthogonality would be improved if 
structures of zero or one fields and unions of zero or one 
modes were permitted. The first of these involves problems 
With a "structuring" coercion unless the symbol struct is 
required kefore structure-displays. If unions of zero or one 
mcdes were permitted then, perhaps union() would be 
eguivalent to void, and union(a) would be equivalent to a. 


These suggestions have not been incorporated into the syntax 


cf Appendix A. 


With the proposal for “row-of-row-of-" modes, another 
type cf rowing coercion is introduced. Rowing may now add 


another "“row-" to a mode already beginning with "row", 
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inserting a new "quintuple" into the "descriptor" 
LR2Sz05.3)7- Sor] ite may2 ada MrowsoL=! to any mode, 


ccnstructing a new descriptor with one quintuple. 


Though a proposal to remove proceduring from the 
language [(4-1Q203A(7)] has met with sone approval, 
proceduring may still be found in the mcdified syntax. 
Proceduring is an interesting concept and is frokably no 
mere ccnfusing than some of the other coercions (or 
extensions, for that matter). The effects of proceduring on 
implementation *111 be discussed in Chapter IV. fhe 
following is an example in which proceduring is both elegant 
and useful. 

ce andf = (becl a, Eroc bool b) bool: (alt{false) ; 
bool:=bool1 andf booli2 andf bool3; 
where "bocl", "bool1", "kool2", and "bool3" are of boolean 
mode. Without proceduring, the “assignation” [R8.3.1] would 


have to ke written as follows; 


bool:=bool1 andf£ (bool: bool2 andf (bool: kool3)) . 


In order to make possible the construction of a 
sublanguage which does not contain any proceduring, a 
modified versicn of [13-prcefosal 1] is adcpted. Routine 
denotaticns become units and are no lcnger packed between an 
open-symbol and a close-symbol. They do not become 
confrontations, because, in the following context they would 


then not ke called; 
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es egnvotdenskip; aut: 
"Casts" [R8.3.4] are made to lock different frcem routine- 
denotations with the "cast-of-symbol" becoming ex or :: (if 
ccnfcrmity-relations disappear). Also the "heap-symbol" is 
made mandatory ir "global-generators" [R8.5.1] [19-proposal 
2jptuthe, reasonssftior”*ethishsehbeing, Onainly pdidactic,inwith 
orthogonality keing improved at the same time. It is then 
possible to allow assignations as "boundscripts" 
[k8.6.1.1.1] because boundscripts may now be units without 
causing ambiguity. For example, afint:i] has a routine- 
denotation as a "subscript" [R8.6.1.1.1], af[heap iant:1] has 
a glckal-generator as the lower bound of a "trimmer" 
aoe Osileal sti), andes a(anteexiite se amcUDSGr 1 pc. 


Similar examples with assignations are a[i:=int:j], a[i:= 


Chapter 9 of the Report introduces extensions, which, 
when applied to a program in the strict language, produce a 
program in the extended language. Extensions, which mainly 
ccncern "comments", declaraticns, “repetitive statements", 
and case-clauses, cause ALGCL 68 programs tc look more like 
programs in certain other languages. However, extensions 
seem out of place in a rigorously defined language such as 


ALGOL 68. Their method of description employs ne formal 
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techniques and, conseguently, automatic methods of 
recognizing a fregram in the extended language are 
impossible. If the features introduced Ey extensions are 


deemed necessary to be in the language then they should be 
formally defined as part of the strict language. Some 
features, for example "options" [R3.0.1.b]j, presently in the 
definition of the strict language are no more than 
extensicns anyway. Thus all extensions were studied and only 
those considered necessary are included in the syntax of 


Appendix A. 


Ccmments [R9.1] are included in the syntax Ly means of 
"tokyls" which are one syntactic level higher than 
"symbols", the terminals of the strict language. A tokyi may 
optionally be replaced by a comment-sequence followed by the 
particular symrkcl. Care has been taken so that ccmments do 
not appéar within comments or string- Or character- 


denotations. 


Contracted conditional-clauses, case-clauses, and 
conformity-case-clauses [R9.4] have been unified with 
conditicnal-clauses as mentioned in secticn five of this 


Chapter. 


A syntax for repetitive statements DRes 31) is 
incorporated so that a "“repetitive-clause" becomes a 
producticn of unitary-clause, repetitive statements being 


specified by the extension as "“strong-unitary-void-clauses", 
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The form of the repetitive-clause has been altered so as to 
allow the definition of a meaningful range incorporating the 
"controi-identifier", the “while-clause" and the "do- 
clause". Also all constituent clauses become serial-clauses. 
Fer example; 


from 4 by 2 to 100 for i while condition do skip; . 


The extensions concerning bounds [R9.2.f£] and sub- and 
bus- symkols [R89.2.g] are easily incorporated into the 
syntax while another extension [R9.2.d] has been obviated by 
the acceptance cf a proposal concerning routine-denotations 


as in section seven of this Chapter. 


The remaining extensions concern declarations and as 
demonstrated in [4-LQ203A(4) ] and [20-proposal 2] cause some 
problems, especially since the order of applying extensions 
kecomes important. If extensions had been formally defined 
it is doukttful that these complications would have arisen, 
and those extensions which cause confusion have not been 
included in the modified syntax. Extensions across a 
"“becomes-symbcl" are also excluded. For examyple, 


nt:=(int a) int: skip; 


loc proc (int) 

may not be contracted to 
icc Eroc:=(int a) int: skip; . 

In cases such as this the left-hand side is a generator and 

not a formal farameter as in similar contractions. 


Generators cause the reservation of storage but this will 
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have to be postponed, if such extensions are allowed, until 


the contraction has been "“uncontracted". 


Many semantic modifications accompany the inclusion of 
extensicns in the syntax. For example; 
(lsc =n+ janie as); 
75a contraction of 
(is Sata eC peels ne le Lt tapes 
using present extensions. Thus, the assignation in the 
contraction should be elaborated twice contrary to what a 
user may suppose. In such declarations actual-bounds should 
be elakorated only once and the semantics will have to 
reflect this. Perhaps a suitable method of semantic 
description would regard the whole declaration as a 
"structure", the actual-declarer of which is elaborated once 


only. 
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CHAE Tian 1. La 


A CNE-PASS CCMPILER 


3.1 Intreduction 


ALGCL 6& is defined in terms of "actions" performed by 
a “hypothetical computer" Ven 2ere le A model of this 
hypothetical ccmputer, using a physical machine, is an 
“intplementaticn" of ALGOL 68. A sublanguage of ALGOL 68 is a 
language whose particular-programs are particular-programs 
of ALGOL 68 and have the same meaning. In an implementation, 
the particular-program may be translated into a program in 
the machine language of the physical machine. This 
translaticn is best performed by the computer itself using a 
special program called a “compiler”. The particular-progranm 
cr "source program" is written in the "source language", 
ALGCL 68, and is compiled into the "cbject prcgram", written 
in the “object language" or “cbject code" of the physical 
machine. The translaticn of the source program into the 
cbject prcgram occurs at "compile-time", and the execution 


of the ckject program at “runtime". 


A compiler must perform an "analysis" of the source 
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program and then a "synthesis" of the object program. 
Schematically, the logical parts of a compiler are shown in 
Figure 3.1. This diagram represents a logical connection, 
rather than a time connection of the compiler parts. The 
processes can ke performed in the order depicted by Figure 
3.1, or they could be performed in a parallel, interlocked 
manner. A ccmpiler which performs these processes in the 
former fashion is a "multi-pass" compiler; one that performs 
them in the latter fashion is a "one-pass" compiler. In a 
one-pass compiler, the source text is processed once, and 
executable okject code produced immediately. Actuaily, very 
few ccmpilers are truly cne-pass. For example, the place in 
the program to which contrcl is to be transferred Ly a jump 
tmstruetion t(in IALGOh<h6 ~rdo prtourhpl woke exampheje 135 senot 
generally known when the jump instruction is encountered, 
unless the transfer point precedes the jump. Thus some sort 


of "back-tracking" is necessary. 


Scme languages are structured so that they cannot be 
translated in a reasonable fashion by a one-pass compiler. 
ALGOL 68 is, in fact, cone of these languages. Restrictions 
to the language are necessary and these will be discussed in 
secticn three of this Chapter. Also in ALGOL 68, there are 
cases where it is necessary to scan ahead in the source text 
in order to resolve some indeterminations. These cases will 


be discussed in Chapter Iv of this thesis. Thus the proposed 
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implementation is actually one-pass with some "cheating", in 


the fcrm cf kack-tracking and locking-ahead. 


In a cne-pass compiler it is not necessary to store, 
fer any length of time, intermediate object code that might 
be produced by a multi-pass compiler. Nor, in a one-pass 
compiler, should any functions of the ccmpilier be 
duplicated. For example, in a multi-pass compiler, certain 
secticns of the source text, or slight modifications thereof 
may te analyzed more than once, each time ky a different 


pass. 


In a multi-pass compiler, on the other hand, it is 
possitle to produce mcre efficient object code fcr larger 
subsets of the language and presumably the language itself. 
Thus the choice of a one-pass or a multi-pass compiler 


depends largely on the objectives of the implementor. 


For the historical developments and a discussion of 
the general techniques of compiler-writing, the reader is 
referred elsewhere. In particular, [22,23,24,25] were used 


in the course of this investigaticn. 
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COMFILER 
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FIGURE 3.1. Logical Parts of a Compiler. 


Objectives 


The objectives of this implementation are as follows: 
(i) to implement a one-pass compiler for a large subset of 


ALGCL 6€. 


(ii) to be modular in design. 


(iii) to provide facilities for good error diagnostics and 


runtime "debugging", 
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A one-pass ccmpiler was chosen in order to provide 
fast turn-arcund (time taken to compile and execute a 
program) for normal, small-to-medium size ALGCL 68 programs, 
the type of programs one finds in an educational 
envircnment. It is planned that the whole system should 
reside in core, the only input/output being the source 
program itself and the irputyoutput which the source program 
carries out. Only minimal local optimizations of object code 
will be made, producing relatively inefficient cbject 
programs in aksolute machine language, which the system will 
execute immediately. Thus no expensive linkage-editing 
(combining of object programs) and/or loading {separate 
insertion of a relccatable object program into the physical 


machine) is necessary. 


The modular design is employed so as to keep the 
various tasks of the compiler distinct and weli-defined. It 
is proposed that some modules be included or excluded 
depending on the setting of particular program switches. In 
this WayalitSuilitbe possible to study the relative effects 
cf scme cf the features of AIGOL 68 on both compile-time and 
Funtime *eFficiency;° and**+onsthe> size® of**the*t compiler’ 
Consequently, information concerning the development of 


sukblanguages for ALGOL 68 [26] may be obtained. 


Because of the possible use of the compiler in an 
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educational environment (or in any environment where 
programs are Leing developed and not just run), it is 
necessary to provide facilities for good error diagnostics 
and runtine dekugging, all tco often neglected in the design 


cf ccmpilers for high-level languages. 


323 Restrictions 


— — — eS SS Se 


The fcllcwing restrictions are made tc ALGOL 68 in 


order to facilitate the cne-pass implementation. 


1.) Add the Ecllowing context condition [R4.4.1.d] 
concerning identification. 

od) No proper particular-progran in the sublanguage 
contains an applied eccurrence of a mode-identifier 
{indication-aprlied occurrence of a dyadic indication, 
Operatcr-aprlied occurrence of an operator) which is 


textually btefcre its defining occurrence." 


Consider the following example; 
bkegin real x; 
proc p = void: begin a x; «+. end; ... . 
iieeeetlictlLyalett=to-rignt Scatectestilo erect tt Se NOT 
known whether a x is a "monadic-formula" [R8.4.1.g] or a 
declaraticn cf a local variable "x" until the defining 
cccurrence of a is encountered. It is for this reason that 


Maillcux suggests in (27] a special pass to process mode- 
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indications, dyadic-indications and operators, tased on 
their defining cccurrences, With the identification 
condition above, a in the example could only be a mode- 
indicaticn Ett a ccntext condition to be intrceduced later 
will also exclude this case. The fact that this example is 
illegal will ke reported at inspection of a ina x because 


no defining occurrence of a has yet been encountered. 


The problem with mode-identifiers is okvious from the 

following example; 

Fegin int i; 
begin proc p = void: begin i:=1 end; 
anovnigt: that 

Again, in®tacwleft=to-right scan;e®rcitrdis Snot) known! iwhen 
inspecting the applied cccurrence of "i" in i:=1 whether it 
identifies the defining cccurrence in the first line or one 
that occurs later, as is the case here. It is for this 
reason that Mailloux suggests a further pass to process 
identifiers based on their defining-occurrences (now 
recognizakle due to the frofosed first pass) in identity- 
declarations (and in “labels” [R6.1.1.k] though these 
"label-identifiers" [F4.1.1.b] are treated separately here). 
The example is illegal in the suktlanguage and this should be 
reported cn inspecting the defining occurrence of "i" in int 
i cn the third line. It is illegal because an applied 
occurrence has cccurred textually before its defining 


eccurrence kut matters are further complicated because 
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another defining occurrence was applicable (during the left- 
to-right scan) when inspecting "i" in i:=1. Thus some record 
must ke kept cf applied occurrences of mode-identifiers, and 
for similar reasons, of operators and indications (including 
mode-indications) so that errors such as this may be 
reported. Mailloux points out that policing this item is 
likely to be almost as costly as keeping the first two 
passes and the unrestricted language. Methods of recording 
applied occurrences exist and one of these will be described 


in Chapter IV of. .this thesis. 


With the introduction of the above identification 
condition, a modification of that presented in [27], the 
identification problems concerning mcde-~identifiers, 
Operators, and dyadic-indications are resclved, requiring 
"things tc ke defined before their use". As noted by 
Feet Gare this is not a serious restriction in that 
"sensible" programmers “always" declare things before their 
use. However it is now more arduous to write mutually 
recursive operators and frocedures. It is fortunate that 
ALGOL 68 leaves a loophole which enables this to ke done 
despite the reguirements of the above identification 
condition. For example; 

op p = (int a) real: begin ...; g 4; «... end; 


br) int: begin ..-; p (b-:=1); «.. end; 


" 
~~ 
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ict 


Qzya 


is now illegal, but may be rewritten as follous; 
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The reason that mode-indications were not included in 
the above condition is that it is possible to have mutually 
recursive modes in unrestricted ALGOL 68, and no _ loophole 
such as the one employed above for operators exists to 
enable the alternative declaraticn of these mutually 
recursive mcdes. FOr example, the following mode- 
declarations cannct be rewritten to avcid recursive 


definition: 


MOCERAs=SSCLUCt (Ler Ded la letv cede a teas)\is 
mMOdewbhe=SStLUCt (rel cubl srelea be real. bs):, 


MOGEeC =eStruct (ref a cl, rele bec2, sboO LNCS) si. 
Thus another context condition is defined which allows 
recursive mode-declarations but requires definition before 


actual use. 


2. Add the following declaration conditions [R4.4.4]. 

ed) ai an indication-afprlied occurrence of a mode 
indicaticn identifies an indication-defining occurrence of 
that mode-indication, then it must also ‘ultimately' 
identify cne or more indication-defining occurrences of 


mode-indications found by the following steps: 
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Step 1: Each mode-indication is said not to have been 
encountered; the given indication-applied occurrence is 
considered; 

step 2: The considered {indication-applied} occurrence and 
all mcde-indications which are the same terminal production 
of 'MODE-mode-indication' are said to have keen encountered; 
the indication-defining cccurrence identified by the 
considered occurrence is said to be ultimately identified by 
the given cccurrence, and is itself considered; 
Steprranceiivithe iconstituent declarer of the node- 
declaration, of which the considered {indication-defining} 
occurrence is a constituent contains one or more indication- 
applied occurrences of net yet encountered mode-indications 

(cther than occurrences ccntained within a boundscript 
contained within that declarer), then each such {indication- 
applied} occurrence of each such mode-indication is 
considered in turn, and for each one Step 2 is taken. 

e) No proper particular-program in the sublanguage 
contains a generator (a formal-parameter) whose constituent 
declarer contains a mode-indication (other than a mode- 
indicaticn contained within a boundscript contained within 
that declarer) which ultimately identifies an indication- 
defining occurrence of a mode-indication which occurs later 
in the textual order than the given declarer." 

This condition is taken almost exactly frcem [28] and is 


actually simpler than it appears. For example, in 
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the applied cccurrence of a lngtheselastevlinejuvgitinately’ 
identifies the indication-defining occurrences of a, then 
real (in the standard-prelude) and b, then bool and c. In 
the identity-declaration a x (contracted from ref a x = 
ioc a) a is a generator and since all indication-defining 
occurrences which it ultimately identifies textually precede 


it, the example is thus permitted. However, 


would not be allowed since when the declaration a x is 
encountered a ultimately identifies real and b and no 


defining cccurrence has yet been encountered for Ek. 


It would not be in order to reguire that defining 
cccurrences of label-identifiers precede applied occur- 
rences, for then 

ea OOMUC Li mente slic Male eens 
would be illegal. In ALGOL 68 it is possible to cmit the go 
to in a “jump" (88.2.7.1.c] and this can cause preblems for 


the one-pass compiler. For example, in 
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p: skip end; 
it is not known whether the seccnd occurrence of p is a jump 
or net. Thus go tos will be required in every jump. This may 
be more restrictive than necessary trut it is less 


complicated to explain. For example, in 


there is no proklem and the ge to has been omitted. However 
the requirement of go to alleviates protlems such as 
recogrizing (11:12), by reducing the number of possibilities 
for this ccnstruction. The following example, 

PF: Skip; 

fegin go to Pp; 

Pp: skip end; 

is legal for latel-identifiers unlike the corresponding 
example fer wmcde-identifiers. The compiler must carry out 


kack-tracking in crder to resolve examples such as the 


Vatter. Thus the third restriction is as follows. 


3. The syntax for a jump becomes [R8.2.7.1]} 


"c) MOID jump: go to symbol, label identifier." 


G4. The final restriction concerns parallel processing. 
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There 1s no parallel processing and no use is to be made of 
the tact that the user might write commas’ instead of 
semicolons, giving "collateral elaboration" PRAT 2ck Soave 
R6.2.2]. The "parallel-symbol" will be ignored and if the 


"sema-symbol" is used, an error message will ensue. 


3.4 Implementation Dependencies 


—— ee ee i se ee ee ee ⸗ 


When designing a compiler for ALGOL 68 it is necessary 
to choose the number of different lengths of integers and 
reals to ke inplemented. ALGOL 68 makes provision for an 
infinite numrer of lengths but to actually have an infinite 
numker cf dlengths is, of course, impractical. It is then 
necessary te choose "internal" [R2.2.1] representations for 
integral and real modes so that long modes and the newly- 
added short mcdes may also have representations. AS an 
example cf this, consider a physical machine with "full- 
word" and "half-word" integer arithmetic and "fuli-word" and 
"douktle-word" real arithmetic. Then it would ke possible to 
select a fuli-wcrd to represent integers and a half-word to 
represent short integers. For reals, real and long real may 


Gorrespond tc full- and double-words respectively. Certain 


choices regarding kits and bytes must also be made. 


In the standard-prelude of the Report there is a 
secticn entitled "Envircnment Enquiries" [R10.1] in which 


definitions are given for a number of identifiers and 






















. Pi M 
10 «6sp6hem od oF al aay on bas p ' = 


* 


to bseteat esmmoo otiaw — ot 1 se 
ged fal] *not¢s1odsile laze. 
odd ti bee Beronpd od iLiw tote 


alana at si 89 aooua 103 tof ianos 5 paingtsot nod F 


‘7 


bare aqepatat to edspast pall Beene to sedeur. edt I< 
438 eS 


a5 3103 soieivorzg — * LOath sbedaomotgnt ed ed els + 


¢ ¢ 
a4 as 


eviaizat as eved —J— os — —— — — —— to “teva * be nd 


ia ae | 
nedt ef +1 -Ls.tdosagmi eeREHOS to ret adzonet 1 reda@urn 
* see 

iy 

toi aenofistaesaeages [f.§& 0) *f£earosai" e200d> > YI 

t gl 


-yiven od+ bas 2ebom pnol sand o2 eabon ise bas ees 


nb 2@A ,anoltssneestqze2 oved oasis Ysa es boa 2260 


~ffuk" dtiw enidosa Inoteyda & zsbiano abd sat 


bos "baow-ilvt" bas sksenasias — baov⸗ at a · ae Re 


V, 1 


o¢ sidtazog od biuow +i aed? ob dondsizs keox d aou⸗ ot⸗ 
he al 

od baow-2lad s bas — aoaeaasa oF basw-f103 s Pit 
ar) by: i a a) paid | be: 


vse 622 pgol bas ten 2 lsex rot -a9pesai pa oe ⸗asas⁊ 
oe A is te! od) 84 
niss3s9 . (levi toeqaes abrow-eiduob bos ~Iiv2 of — y 2 
oo q 


966m od opts sa08 ee2td ba ® atid cekpeenn al 
| 2 See eee / RE 


6 2 oe1s9de saoged eft to —— 
doidw at (1 .0°8] *aekzhwpas “guowaoaivnt™ 


a 4 ee 
bos azsttisaybt to — a9 J 
e 


—* my . 





46 


Operators that enable the programmer to ascertain certain 
features which are dependent on a particular implementation. 
The function of the identifiers is to supply information 
such as the number of different lengths of integers (the 
identifier "int lengths") and reals ("real lengths"), the 
Humber of widths of bits ("bits widths") and bytes ("bytes 
widths"), the largest integral value ("max int"), the 
largest long real value ("long max real"), the smallest real 
value that can ke meaningfully added to or suktracted fron 
che ("small real"), and sc forth. With the introduction of 
Short medes, it is likely that additional environment 
enquiries will ke made available so that, for example, the 
humber of short integer lengths and long integer lengths may 
be ascertained. Iwce operators, abs and repr are alisc defined 
such that aks "a", for example, will return the integral 
eguivalent of the character “a", and repr 6, for example, 
Wii eyreturn) that character “x","i1f it exists, ssuch that tabs 
"y" = 6. Further envircnment enquiries in the standard- 
prelude concern transput, for example, enquiries ccncerning 
"Channels" {R10.5.1.1] and "external" PRO eette 
representations [R10.5.2.1] (the number of decimal digits 
required to represent accurately the chosen internal 
representaticns). These features all depend on the 
particular machine on which ALGOL 68 is being implemented, 


as well as depending on the implementor himself. 


7: — 




















— — ——— — 
-noitetagasigai mivottreg tap tantangeh ne site 
soivtsg2otait yigaue of et atetitsasbs oft t0— it 
od?) atopetni to adtpnsd jostetttb 20 zedaua ads — 
add ,{"adtpael Leer) 2hseor bas. ("edsenel sabe soistoneti 
azedyd") aotyd bas (“ad¢biw atid") atid Yo adtbéw to. 
edt ,("tai xea") oplev Isgpetak deopisis ade 4 
Iso1 taelisma sdt , (“Iset xem ager ouisy Lee1 pool — 
no1i botosatdve 20 oF bobbe viLurentasen od os2 — 


ee ad 
to noltowboatat oda dd iw .Ad20% 22 bas —R L = 
taeanotivas Lseaoitibbs sted lexis at +t ,eebom g2rode 5 


~ =e «<2age ao 
edt ,slqusxe 20k ,tedt o@ sidsiisvs bss. aes tone eink i 
yen edtpoel aspetat paol bas adseask s9pesat oas 
boniteh cals 216 gggz bas ads .210sHI9I0 ae? sbanies 


isupesni adt aiuge1 {itv ,elqmaxe tot ,"6" 


‘ 34 pe oi 

.sigasxe 70% ae agez bes ,"s8" itet0oszsd9 ods to 

9 — 
ads +sdt dove — st = ."x" setosisdo teds — 

ey tae? \2 a 

vbas bat ⸗a aut Pr aotziopas sagen oyteie — F 

— —— 
painzeonoo esinispase voiquare 102 .tuqensat aas daoo dusaq 

14 owe Ml 

Ch.S.S8 } Asnasaxs· bus { — 9 "aleansdo" · 







4D dig: Lae A ‘he an 







astpib Llemioeb to redaen — ———— oes agsa 
‘een 4 Ee — — —A 
leniesat aseod> ads ‘vhosexpa08 ⸗nsas? — beriupsa 


edt ao baegeh Ils castles | — oni 


,boinousigqei pabed. #1. 9 £0016 dokdw a 
-ifeoaniad. ꝛe⸗acaoiaoi· ↄv⸗ 





47 


The following example illustrates a problem. Let the 
number of lengths of real be 2, i.e. "real’°lengths = 2", 
correspcnding to real and lcng real, say. Then what if the 
programmer writes 

long long real x := long long max real; ? 
In the proposed implementation, a warning will be given in 
such cases and the above assignation will be compiled as 
long reali x := long max real; 
with "x" being considered to be of mode long lcng real, in 


complete accordance with the Report. 


The number of longs able to precede a declarer depends 
largely on how modes are stored. For example, long long long 
real may te stored with a special counter in its de- 
scription, which in this case would be set to 3 (for shorts 
this counter could te negative). The situation is similar 
for the number of refs and the number of rows (i.e. "row-"s 
or “row-of-"s) that may precede a mode. Note however, that 
cuntime considerations will certainly restrict the numker of 
rows. In the akove case, where a counter is used, the 
Maximum number of Jlcngs permitted would depend on _ the 
maximum value which the counter could assume. Other methods 
for storing mcdes exist [29] where the number of dJlongs, 
on the size of a counter. In the proposed implementation, 


the restricticns will not be severe. The number of longs or 


Shorts will depend on the size cf a counter, the number of 
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refs on table size, and the number of rows on runtime 
considerations. No syntactic changes reflecting these 
restrictions have been inccrporated into the syntax of 


Appendix A. 


It should ke noted that the Refort allows an infinite 
number of identifiers to be defined within the standard- 
prelude. For example, “long max int", "long long max int", 
"long long long max int", etc. There will have to be a 
special technique for storing and looking up _- such 


identifiers. 


Two other possible implementation dependencies are the 
length of string-denotations and the maximum number of 
letters or digits permitted in identifiers and indications. 
In the proposed implementation, it is hoped that these 
limits will depend on table size (such entities must be 
stored) rather than on some feature of a particular physical 


machine. 
















gaeds paisoafie: eepasdo 
to xetaya oct otat besexogroont 


stinitnit as awolls sr099R eds sea badon | X — 


-bisbast2 eft aldtiv Doniteb ‘sd 08 ‘aastettnabi $0 3 = 

"sat xem pnol pmol” , "tai xsa pnor” «9LqGesxs © 10% eye 
oy ary! Tie? ae 

5s od ot aved Lliv ezedtT .3t9 ."tar xem pool eat 


dowe qu paktdool bas paizote tot a tal “i 
a vanet 


—— SATE 
eit 918 29ionsbaey ob sol tasnous tga — E oe ow? 


20 xeddae auvixen odd bas — com 


-anoiseotbat bas ers itisaehi ai bortinizeg astgkd 30 — 
2as ds sods begod at $2 enoizarsagacigqni | ‘beacqorq, 7 9d9 


ol a 67 . a 


ad taum eeititaa dopa) este eldest ao basqed. tte 8! * iL 


bs ieee 
dost vy 
4 x 


— 4 


tsoieydg — * 6 to ⸗v⸗avt amoa mo ands reaver d 


&i 2 ep rhe 


J ga — oe 

Vit 9 1 Fl et eR ore 
4 ) ._&th ald - ‘jon 
heart ae | 


49 


CHAPTER IV 


A ONE-PASS IW¥PLEMENTATION 


4.1 Intrcoduction 


This Chapter proposes an outline for a one-pass 
ccmpiler for the sublanguage of ALGOL 68 detailed in Chapter 


Tigseotrethisséthesis. 


ALGOL 68 incorporates many concepts from other 
programming languages, and basic techniques developed for 
the compilation of these other languages are sometimes 
applicakle or perhaps partly applicable to the design of a 
compiler for ALGCL 68. For instance, methods of handling 
structured values are kncwn, at least partially, from 1list- 
and string-prcecessing languages (for example, LISP {30] and 
SNOBOL {31]). Runtime organization of ALGOL 60 
inplementations employs stack techniques which, in 
principle, may be used for ALGOL 68. However other concepts 
intreduced in AIGOL 68 make implementation difficult, and 
cause proktlems nct previously encountered in the design of 


compilers for high-level programming languages. 
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The remainder of this Chapter discusses the major 
components of the compiler and system, giving techniques or 
references to techniques for scme of the tasks which must be 
performed within these components. Particular emphasis is 
placed on those techniques that are peculiar to a one-pass 
but not a multi-pass compiler. 


4.2 Lexical Analyzer 


The lexical analyzer cr "scanner" is that part of the 
ccmpiler which scans the characters of the source program 
from left to right, constructing the complete symbols of the 
program, as well as composing denotations and identifiers. 
In general, the operations of lexical analysis are well- 
defined and standard finite-autcmaton technigues [23-Chapter 
3, 321 will be used for the scanner of the pfrcposed one-pass 


compiler. 


More specifically, the lexical analyzer must perform 
the following functions: 
1. Do the bookkeeping required to know where in the input 
stream the ccmpiler is oferating, fetching new source 
records and listing these if required. This part of the 
scanner may Le somewhat intplementation-dependent due to the 
inputyoutput cferations that are involved. 
2. Ignore comments and insignificant blanks (those not in 


string-denotations) that cccur in the source progran. 
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3. Perform acticns specified by "compiler directives" or 
perhaps as suggested in [R2.3.c] "pragmats". Eragmats enable 
the user to specify, for example, whether or not a listing 
of the source text is required, the column range of source 
records, cptional features that the compiler provides, etc. 
4. Ccompcse symbols from the characters of the source text. 
These symbols will have representations according to the 
representation language given in Appendix C of this thesis. 
5. Ccmpose identifiers from sequences of letters and 
digits. 

6. Ccmpose and convert to internal form all denotations 


except routine- and format-denotations. 


In the representation language of Appendix C, care has 
been taken to ensure that a sequence of characters may be 
uniguely partitioned into symbols. For example, 

ats=s:/?%*b 
may te partiticned uniquely as follows: 
a +3= a37 2* b 

where “a" and "b" are mode-identifiers, +:= is a dyadic~ 
operator and -:/ and ?* are mcnadic-operators. This unique 
partitioning is desirable because the number of 
possibilities for syntactic analysis is thereky reduced. The 
syntax given in Appendix cC for "dyadic-indicants" and 
"monadic-indicants" [R4.2.1] which consist of special 
Characters (characters other than letters or digits) are 


expressed using regular grammars [23=Chapter 3.) to 
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facilitate ccnstruction of the lexical analyzer. 


If a given representation represents more than one 
symbcl then scme ccmplications arise. For example, +, -, =, 
ie ls ,SuRk Ut, the, fe hcand éhwearoesésuch, representations 
according to the representation language given in Appendix 
Co FOr i+t,°-, and = -the’ complications are minor ,:and,, are 
easily resolved hecause of the limited contexts in which the 
corresponding symbols may appear. For instance, +(-) may 
appear as a representaticn cf the "plus-symkol" ("minus- 
symbol") in an “exponent-part" of a "real-denotation" or in 


a "sign-frame" within a format-denotation. +(-) may also 


appear as a representation cf the "add-symbcl" ("subtract- 


symbol") within formulas, operation-declarations, or 
priority-declarations. These cases are easily 
distinguishable, as are the cases for = which wiil not he 


detailedevheresr For ji,oltyzoein andcoutethe)}complications,are 
also minor and the immediately surrounding context will 
determine exactly which symbols these representations 
represent. Complications with ( and : are more serious and 


these will be discussed in the next section of this Chapter. 


4.3 Syntax Analyzer 


The syntax analyzer or “parser" analyzes the source 
program, decomposing it into iAts’ constituent farts, and 


calling routines to perform specific functions dependent on 
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the type cf construct currently being analyzed. The syntax 


analyzer forms the nucleus of the proposed one-pass 


ccmpiler. 


Most of the methcds developed for the syntactic 
analysis of programs presume that the language in question 
is at least bcunded-context [33]. ALGOL 68 does not fulfil 
this conditicn and is not amenable to the usual, context- 
free analysis methods [23-Chapters 4,5,6]. Research on the 
automatic analysis of ALGOL 68 programs, guided by the two- 
level syntax, is currently being carried out [34] but such 
techniques will net be used in the proposed implementation. 
The methcd of parsing selected for the proposed compiler is 
that of a "tcp-down recursive descent" analyzer based on a 
context-free syntax which describes a superset Of ALGOL 68 
and which appears as Appendix B of this thesis. A context- 
free grammar fcr the language described by the two-level 
syntax of Appendix A could not be generated automatically 
from the syntax of Appendix A, since the two-level syntax 
generates an infinite number of context-free production 
cules. It is for this reason that the syntax cf Appendix B 
describes a superset of the language described by the syntax 
of Appendix A. No preblems arise however, as improper 
programs which will be parsed as correct prcgrams in the 
superset, will ke detected by routines called by the syntax 


analyzer. During the development of the context-free syntax, 
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much use was nade of {35}. 


The top-down method of analysis known as recursive 
descentit 972) nctsd described svasifollonsy,e,;form avgiven 
ncnterminal U, there is a recursive procedure which parses 
phrases for U. ‘The procedure is told where in the source 
text to regin locking for a phrase for U, and hence the 
method is “goal-oriented" or "predictive". The procedure 
finds the fphrase by ccmparing the text at the point 
piovedated= With *right parts Sof Sproduetion erulies Sf0ce0, 
Calling cther frocedures to recognize subgoals when 
necessary. Thus the subject language syntax is built into 
the analysis program. Some advantages of this method are 
teateecules ecCane bes rearranged) toe Liteethes necdseor the 
procedures and to improve efficiency of the parsing process, 
and that cther routines for further syntactic analysis, for 
checking context conditions, and for code generation may he 
inserted anywhere within a procedure and not just at the end 
when the phrase is detected. Two disadvantages with the 
recursive descent method of analysis are that more 
programming and debugging is required and that modifications 
to the syntax are harder to incorporate than if other 
methods of analysis were used. However, neither of these 
disadvantages is thought to ke severe. In particular, in an 
effort to reduce compile-time, the method of recursive 
descent was selected and methods using Floyd-Evans 


Production Language [36, 37] or modifications thereof, which 
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are employed in scme other implementations of ALGOL 68 Tao, 


39], were rejected. 


It should be noted that the syntax analyzer is not to 
be constructed directly frem the syntax of Appendix B but 
rather frem a contracted syntax in which rules are ordered 


to obtain a minimal amount of back-tracking. 


The syntax of Appendix B contains twe intrinsic 
ambiguities. The first involves certain constructions which 
may tke parsed as either slices or calls; for example, 
a (., yY). Houever, in Chapter 117— 0of this thesis, a 
restriction was imposed to ensure that the defining 
occurrence of a mode-identifier would precede any applied 
occurrence. In an example such as a(i,j) then, the defining 
occurrence of "a" should have been encountered and whether 
ERSVT OL Ven esCCHStructions is “a slice o1 Jae cali «may She 
determined Fy inspecting the mcde of "a", If no defining 
occurrence of “a" exists, then the program containing the 
construction is not a proper particular-program in the sub- 
language and an error message will result. The second 
intrinsic ambiguity in the context-free syntax involves 
"indicants" [K4.2.1], mcnadic-indicants, and dyadic- 
indicants. For instance, a may be a terminal production of 
any of these and consequently a a and (1:1) a a each allow 
two interpretations. In the case that a is an indicant, the 


two ccnstructions above are identity-declarations. If a is a 
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monadic-indicant in the first constructicn cra dyadic- 
indicant in the second construction then the ccnstructions 
are formulas. This ambiguity is)» resclved by way. of 
restrictions imposed in Chapter III, by requiring mode- 
indicaticns, monadic-operators, and dyadic-indications 


(dyadic-operators) to be declared before use. 


Majcr ccmplications in the syntactic analysis are 
caused by the indeterminations that exist concerning left 
farentheses and right parentheses (less importantly, since a 
right Farenthesis must have a corresponding left 
parenthesis). In ALGOL 68 parentheses are "overloaded", that 
is, there are many ccnstructions which farentheses may 
delimit. According to the syntax of Appendix A and the 


representation language of Appendix C the left parenthesis 


is either a representation of open-symbol OL a 
representation of "“brief-conditional-begin-", “brief-case- 
begin-", or "“brief-conformity-case-begin-symtol". A left 


farenthesis encountered in a left-to-right scan of the 
source stream may fall into one of two categories as 
fcllows: 

(i) A left parenthesis, the left context of which is 
sufficient to determine the construct which the left 
parenthesis precedes. For example, in each of struct (int i, 
real r) and struct s = (real a, b) the struct to the left of 


the left parenthesis determines the following construct as a 
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Tie CS= declaratornt's (Rie Infos Ob}s 
(ii) All other left farentheses. The fcllowing are 
constructs (with examples) that a left parenthesis belonging 


to this category may ktegin: 


vacuum Q 

routine-denotation (a “a)p bet@skip 
actual-declarer (11:12)a a 
vVirtuai-declarer (,:)a a 

closed-clause (a a; skip) 
ccllateral-clause (skip, skip) 
conditiocnal-clause (bool|skip|skip) 
case-clause (int{skip, skip|skip) 
contormity-case-clause (union| (a a): skip{skip) 


whereo;a andi)b are node-indications, andi "11". "12%,) "a", 
"bool", "int", and "union" are identifiers. Note that the 
left parenthesis beginning a specification (in the new 
conformity-case-clause) belongs to category (i) as the 


specification is distinguishable by context tc the left. 


When analyzing the program, context to the left is 
taken into account by the mechanisms cf recursive descent 
and so farentheses belonging to category (i) cause no 
ccmplications. However, in general, an infinite context to 
the right may be required to distinguish the parentheses in 
category (ii). Iwo approaches to this problem will be 
discussed. First, bie is possible to consider all 


interpretations of the construction in parallel, eliminating 
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cases as analysis of the source text proceeds. Finally when 
context to the right has eliminated all Eut one case, the 
effects of the incorrect interpretations must ke cancelled. 
This approach is very difficult for a one-pass compiler and 
would require extensive back-tracking. The second approach, 
the one adopted in the proposed implementation, is tc employ 
a look-ahead algorithm which will be initiated at left 
parentheses belonging to category (ii) and will divide the 
Gonstructs scintcctat icleast . efour ceoclasses peasi;-follows: «(a) 
vacuums, (b) virtual-declarers, (c) routine-denotations, and 
(d) all cther constructs beginning with left parentheses 
belonging to category (nil) sit’ is Dot 
immediately necessary to distinguish all cases mentioned in 
category (PiNS censtructrathat remust spbe 
discerned is the routine-denotation. For example, consider a 
aein (atageskip) andn(a apabt skips Inetheesfirst: case, oha 
closed-clause, a ais an identity-declaration. In the second 
case, a routine-denotation, a a is a formal-parameter. These 
constructs require different actions by the compiler and 


thus the cases must be distinguished. 


Note that if array was reguired before declarers 
specifying “row-of ="! modes then virtual- and actual- 
declarers would net appear in category (ii) above and 


complications would be scmewhat reduced.. 


The look-ahead algorithm is simple in principle and is 
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based primarily on the particular symbols that may appear 
between left and corresponding right parentheses, excluding 
any text surrcunded by inner parentheses. The following are 
the particular symbols for the first three of the above four 
classes: 

(a) vacuums: empty (i.e. no symbols at all). 

(b) virtual-declarers: empty, ",", :, "mode-words", mode- 
indications, mode-identifiers, tags. 

(Cc) routine-denotations (i.e. fcrmal-parameter-packs): ",", 
mode-words, mode-indications, mode-identifiers, tags, {, ), 
{+ ]+ (4+ /)e 3, Elex, either (these latter three may appear 
soy ecetween (/sand-/).or [Pyands )*) : 

Mode-words include the following symbols which are 
aasoGiatede swith “declarers:eprocy) union, eStruct,=ret, 10nd, 
short, and void. For vacuums and virtual-declarers the look- 
ahead algorithm scans past the corresponding right 
parenthesis, to distinguish petween cases such as () and 


()real. AS soon aS a decisicn is reached the algorithm 


terminates. Extra information is utilized in ®’ethis 
ccnnecticn; for example, the cccurrence of | cr {: implies a 
ConuLtsonal—, case-, Or conformity-case-clause; the 


eccurrence of + inplies a conditional-, case-, conformity— 
case-, Or Closed-clause. Note that this algorithm assumes 
the acceptance cf modifications tc formal-parameter-packs 


and formal-declarers as outlined in Chapter ii and 


incorporated in the syntax of Appendix A of this thesis. The 
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new restrictions concerning formal-declarers imply that the 
algorithm should not usually have to look far to the right, 
even though technically speaking, the context to the right 
is still unkounded. The look-ahead algorithm will be 
attached to the syntax analyzer and lexical analyzer in such 


a way that lexical analysis will not be performed twice. 


Following the application of the look-ahead algorithm, 
a construct beginning with a left parenthesis and belonging 
to class (d) as defined above may be a closed-, ccllateral-, 
conditional-, casé-, or conformity-case-clause, Or an 
actual-declarer. Except for one special case a construct 
such as this is processed from left to right with further 
discernment occurring as mcre source text is analyzed. The 
remaining special case is the locally ambiguous construction 
{l1: expression) where m1 is an identifier and 
"expression" is a unit. Here the overloading of : causes 
problems. If : is the “label-symbol" then "11" JSeena stave 
identifier and the construction is a closed-ciause. If : is 
the “up-to-symtol" then the construction is part of an 
actual-declarer specifying a "row-of-" mode. Because of the 
definition-before-use restriction, if "11" is not defined 
then it may cnly be a label-identifier. However, a on & eos AES 
defined, then it may be either a label-identifier or a mode- 
identifier and the construct may correspondingly he a 


closed-clause or part of an actual-declarer. Note that the 
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case where "11" is’ .a\' jump has been excluded in the 
considered sublanguage. The identifier that cccurs between 
the open-symtol and the label- or up-to-symbol of this 
special case will ke called, for the fpurpcses of this 
discussion, the "MABEL" identifier ("11" in the above 
example). This special case will be detected by the look- 
ahead algorithm. The MABEL identifier will ke assumed to be 
a label-identifier unless it identifies the defining 
cccurrence of a mode-identifier, in which case it will be 
assumed to be an applied occurrence of this mode-identifier 
and an entry for the MABEL identifier will be made in a 
Special takbklezs#vate anyicappigeds®. occurrenceseofieas label= 
identifier, this special table will be inspected. If the 
table ccntains an entry that the applied occurrence of the 
label-identifier could identify as a defining occurrence, 
then the corresponding MABEL identifier will be assumed to 
be a label-identifier. The necessary modifications to tables 
will be made and kack-tracking will occur in that the code 
generated under the assumption that the MABEL identifier was 
a mode-identifier, and the construct was part of an actual- 
declarer, must be replaced by the code that precedes entry 
to a range. Finally when the corresponding right parenthesis 
is processed, the immediate right context will verify or 
contradict the assumpticn and the entry for the MABEL 
identifier in the special table may be deleted. In the case 


of contradiction, an error message will ensue. 
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The major disadvantage of using specific criteria, 
such as those mentioned above, during compilation is that 
errors occurring in a prcegram could be very misleading [40]. 
Lifficulties in error handling are inherent in the language 
since there exist tany syntactically Similar but 
semantically different constructions. The error handling 
strategy to be used in this implementation is as 
follows: fcllowing the detection of an error, the source 
program will be scanned tc the right until a symbol is 
found, which together with the left context already 
analyzed, delimits the section of program containing the 
error. The symbols which delimit error sections vary 
depending on the type of error. The matching of symbols, as 
reguired in the syntax of Appendix A, will be used in the 
search for error-delimiting symbols. No attempt will be made 
to correct errors occurring in a source program and no 
object code will be generated following the discovery of an 
error, though analysis will continue. Several techniques for 
reducing the number of consequential errors (that is, errors 
resulting frcem a previous error) are being investigated. 
Scme errors, for example, doubly-defined identifiers, do not 
cause protlems as far as delimiting is concerned but may 
cause many consequential errors. It would be an advantage to 
report at least the possibility that. .an. error is 


conseguential. 
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4.4 Table Structure 


The follcwing are the main tables of the proposed one- 
pass comrfiler. 
1. Lexical analysis table - used by the lexical analyzer to 
generate unique integers for such tokens as "“confrontation- 
tckens", "declaration-tokens", “syntactic-tokens", "sequen- 
Cing-tokens", and "hip-tckens", 
2. Indication and identifier table - contains actual 
sequences of characters used for identifiers and 
indications. 
3. Standard-prelude table - contains entries for operators, 
indications, frocedures, and some of the identifiers which 
are declared in the standard-prelude. 
4. Main takle - contains entries for each operator, 
indication, and identifier occurring within the program 
keing compiled. 
5. Declarer takle - contains entries for each declarer 


occurring in the program beirg compiled. 


Other tables will certainly exist within the compiler; 
for example, a table to assist in code generation will be 
necessary, but tables such as this will not te detailed in 
this investigation. The lexical analysis and _  standard- 
prelude tables are of fixed sizes. The cther three tables 
menticned abcve and the compiler-stack (used by the 


recursive descent routines) will vary in size depending on 
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the particular-program being compiled. If an overflow 
occurs, free storage will be redistributed so that the 
overflow condition disaprfears, if possible, according to an 


algorithm given in [41]. 


The main table will initially ke empty. Any 
declaration of an operator, indicaticn, or identifier will 
cause an entry to be made in this table. In order to reduce 
search time, entries will be made using hash addressing 
techniques with guadratic rehash to resolve ccllisions [ 23- 
Chapter 9]. The information contained in an entry in the 
main takle depends on the type of entry. [39-Chapter 5] 
gives a detailed description of the type of information that 
will ke stored in the main table. A scheme Sifilar to that 
described in [27] is used to represent the static block 
structure of a program in order that correct identification 
of identifiers, indications, and operators may be made. A 
modification is necessary however, in that ranges are not 
immediately recognizable to a one-pass compiler during a 
left-to-right scan even with the look-ahead algorithm as 
given in the previous section. In [42] "corrals" are defined 
as ccnstructs which have the same context as serial-clauses 
(including serial-clauses themselves) and it is proved that 
no errors will occur in the identification of indications, 
operators, or identifiers if all corrals except “formai- 


PARAMETERS" [R5.4.1.C] are considered as ranges. it is 
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possikle to distinguish formal-PARAMETERS frem other corrals 
by use of the look-ahead algorithm. Sc, for the purposes of 
identification cf operators, indications, and identifiers, 
the algorithm in [27] is used but with corrals, excluding 
formal-PARAMETEKS, replacing ranges. In the sequel, when 
"Corral" is mentioned, it is to be assumed that formal- 


FARAMETERS have kEeen excluded. 


When an applied occurrence of an operator, indication, 
or identifier is encountered, the main table will be 
searched for the ccrresponding defining occurrence. In the 
case that a defining occurrence is not found then the 
standard-prelude takle will be searched (also using hash- 
addressing). If the defining occurrence is found in this 
table then the entry will be transferred to the main table. 
In this way, only those operators, identifiers, and 
indications declared in the standard-prelude which are used 
in the particular-program being compiled will appear in the 
main table. If the defining cccurrence is not found in the 
standard-prelude table then for indications and operators an 
error message will be given; for identifiers a final 
possibility exists. A routine will be called to check 
whether the identifier is cne of the infinity of identifiers 
declared in the standard-prelude which, for practical 
reasons, is not yet stored in the standard-prelude table; 
for example, "long long long max int". If this check fails, 


then an error message will be given. 
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As noted in Chapter IIIT it is also necessary to record 
applied cccurrences of indications, identifiers and 
operators so as to ke able to report errors such as the one 
in the follcwing example: 

begin mode hippo = int; 
begin hifpo x; 
mode hippo = real; . 
At compile-time, whenever a new corral is enccuntered within 
the source prcgram, the compiler will reserve a flag for 
each indication and identifier having a defining occurrence 
which could be identified by an applied occurrence in the 
corral. These flags will be stored in the main table with 
their correspending indications and identifiers and will 
initially be set off (a flag may be "on" or “off"). When an 
applied occurrence of an indication or identifier is found 
to identify a defining occurrence, then the flag 
corresponding to that defining occurrence will be set on. 
When the compiler exits frcm a corral the set of flags for 
the corral being left will be “or"ed (in the Boolean sense) 
with the set of flags for the corral being re-entered. In 
this way a record will be kept of those indications and 
identifiers whose defining occurrences were identified by 
applied cccurrences at deeper levels of nesting. Upon 
encountering the declaration of an indication or identifier 


which is the same sequence cf marks as an already existing 
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and accessible defining cccurrence (declared within an outer 
cange), the flag for this existing defining occurrence in 
Eieacurnenittsetfotitlagsewplbubetginspéectedrkslif Hatvevis on; 
then any applied occurrence already processed would (and 
should) have identified this new defining occurrence and an 
error message will be given. If the inspected flag is off, 


then the processing of the declaration will kre completed. 


The above algorithm cannot be applied to operators 
because of the complex mechanism of operator identification 
and the acceptance of the double-identification context 
condition, with the consequent abolition of the loosely 
related condition. The following example should illustrate 
this: 


op rhino = (ref int i) void: skip; 


— — ——— — 


rhino i; 
op rhino = (real i) void: skip; 
op rhino = (int i) void: skip; . 


In a left-to-right scan of the above example, the formula 
thino i is found to contain an operator rhino which 
identifies the rhino declared in the outer range, and has an 
operand of reference-to-integral mode. Following this 
fcrmula are two operation-declarations. The first of these 
declares an operator whose representation consists of the 
same sequence of marks as an already applicable defining 


cecurrence but its operand is of real mode and hence this 
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declaraticn should not result in any error messages. The 
second declaration defines an operator whose representation 
also consists cf the same sequence of marks. However its 
Operand is of integral mcde and now troukle arises. The 
cperator of the formula rhino i should identify this 
defining occurrence since "i" can be dereferenced from 
reference-to-integral mcde to integral mode. With the 
removal otf the loosely related condition, it would now be 
impossible to find corresponding flags within the set of 
flags for operators, if these had been created. The solution 
involves the recording cf ail operator-applied occurrences 
and the mcdes of the operands before coercion. In the case 
that’ an “operand is a “balance"’ [R6.2. 1/6 °or°al collateral= 
clause, the mcdes of the components must be recorded. At the 
declaration cf an operator, these applied occurrences’ must 
re inspected sc as tc determine whether the operator- 
defining cccurrence in the new declaration could have heen 
identitied. This solution iS very expensive and in the 
proposed implementaion it will not be used. Rather, it is 
proposed to use a Similar algorithm to that described above 
for indications and identifiers. A flag will te created for 
each operator with a distinct representation, and for each 
operator whose representation is the same as that of some 
cther operators, but whose operand(s) are of different 
mcde(s) than thcse of the other operators. Thus an error 


message will be given in the case of the declaration of an 
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operator which is the same symbol as, and whose operand(s) 
are of the same mode(s) as the operand(s) of, an already 
existing applicable operator-defining occurrence which has 
been identified ky an operator-applied occurrence ata 
deeper level cf nesting. A warning message will be given in 
the case of the declaration cf an operator which is the same 
symbol as an already existing applicable operator-defining 
occurrence which has been identified by an oferator-applied 
occurrence at a deeper level of nesting. In the above 
example, the two inner operaticn-declaraticns would thus 
produce warning messages. This method is chosen because it 
is much less expensive than the complete solution even 
though, technically speaking, the proposed compiler is not 
now a compiler for a suklanguage of ALGOL 68. It will he 
possitle to write programs in the suklanguage, the 
elaboration of which will be different than that specified 
by the Report. For the chosen algorithm, the maximum number 


flags necessary is equal to the maximum number of 


rh 


fe) 
indications, identifiers, and operators times the maximum 


depth of nesting of corrals. 


The task of representing mcdes in a suitable manner at 
compile-time is not trivial, because the set of modes is 
infinite and modes may te defined recursively. The declarer 
table is ccnstructed in such a way as to enable the storing 


and efficient manipulation of modes [29, 39-Chapter 5, RS, 
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44, 45}. When processing the declarer of a mode-declaration 
the indication of which ultimately identifies (see Chapter 
ttt) an indication-applied occurrence whose indication- 
defining occurrence has not yet been encountered, then this 
declarer must te "remembered" so that when the missing 
indicaticn-defining occurrence is encountered, processing of 


the declarer may be continued. 


The algorithm for determining the "equivalence" of 
modes given in [44] is not immediatly applicaktle in a one- 
pass ccmpiler since this algorithm efficiently tests a set 
of modes already given, whereas in a one-pass compiler, 
declarers must ke processed as they are encountered, rather 
than all at once. Cne alternative is to use an algorithm 
Similar tc those given in [29, 46] for the pair-wise testing 
of equivalence. However, another alternative to be 
considered arises from [47], where a method ci ordering 
modes is outlined. An algorithm based on this method would 
order modes as the corresponding declarers were entered in 
the declarer takle and equivalence could Le checked at the 
same time. This ordering of modes has other advantages that 


will prove Feneficial during compilation [47]. 


4.5 Coercions and Identification of Operators 


ee ee ee a ee ee ⸗ei See 


The prcklem of determining coercion steps is not 


unigue to ALGCL 68 but it is much more complex than in other 
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high-level languages because there are more coercions and 


because there is a potential infinity of modes in ALGOL 68. 


The task of operator identification is also included 
in this section kecause of the rele of coercions in this 
process. Initially the identification of operators proceeds 
analogously tcrthelidentifiatation of Seindications ?*showevery, 
in ALGCL 68, the mode(s) of the operand(s) of an operator 
are also considered in the identification process. The 
mode({s) of the operand(s) of an applied occurrence of an 
Operator ("a priori" modes) must be "firmly coercible" 
fR4.4.3.a} to the mode(s) of the formal-parameter(s) given 
in the ccrresponding operation-declaration ("a posteriori" 
modes). Operator identification is also complicated by 
thalancing" and by collateral-clauses. In the general 
problem there is a set (perhaps infinite) of a posteriori 
modes and a set of a pricri modes. A unique sequence of 
ccercions must te found from a mode in the a friori set to a 
mode in the a pesteriori set such that constraints imposed 


by ccntext are satisfied. 


The processes of operator identification and 
deterninaticn of coercions form a major part of an ALGOL 68 
compiler and the problems involved for a one-pass compiler 
seem to be nc more complex than those involved for a multi- 
pass compiler. An investigation of implementation techniques 


for these processes within the proposed one-pass compiler 
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has not yet been completed. Discussion of the processes and 
algorithms for scme of the tasks involved may be found in 


Mnl2, k29=Chaptern,, CCUZn4ASA 49 pe3%0:). 


MOLre Ox less standard techniques exist for 
synthesizing okject code once tke requisite information has 
been gathered during the analysis phase of ccmpilation. 
Several problems remain for the one-pass compiier and these 


will be discussed in this section. 


The proposed one-pass compiler will generate object 
code directly, locating it within the runtime environment so 
that when compilation has finished, execution may tegin 
immediately. [39+Chapter 11] contains a detailed description 
of code generaticn for an ALGOL 68 compiler. Because of the 
availability of compile-time tables at runtime, it is 
possitie that scme of the runtime routines might be 
interpretive; for example, routines for the elakoration of 
declarers. Transput routines will be based on algorithms 
presented in [51] and thus format-denotations wiil be 
ccmpiled as stggested in that reference. Only very local 


optimizations will be made during code generation. 


Several runtime cpticns will be available; for 
example, initialization-before-use-checking, kound-checking, 


state-checking, and scope-checking as well as diagnostic and 
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debugging aids. These checks are provided as cptions because 
a ccrrect proper particular-program in the suklanguage will 
not require them. Cf these checks, scope-checking warrants 
the most attention and a discussion of scope-checking for a 
multi-pass ccmpiler may be found in [39-Chapter 8], the 
problem not having been investigated as yet for this one- 


pass implementation. 


One of the problems that a cne-pass compiler must 
solve is that of code generaticn for jumps. Often, when an 
applied occurrence of a label-identifier is frocessed, the 
defining occurrence which it identifies will not yet have 
been encountered, or it may have been encountered but in an 
outer range, in which case the compiler cannot ke certain 
whether or not it really is the defining occurrence. 
Therefore, special action using indirect kranching is 
scmetimes necessary. If the defining occurrence of the 
constituent lakel-identifier of a jump has already been 
encountered within the same range as the jump, then code for 
the jump may ke generated directly. Otherwise, a branch 
instruction which transfers control to some reserved 
locaticn in the "coffin" (descriked in section seven of this 
Chapter), will ke generated, and an entry for the applied 
occurrence of the label-identifier within the jump will be 
created in the main table. This entry will ke flagged as not 


having been ccmpleted and will contain a pointer to the 
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reserved location in the ccffin and a pointer to a defining 
occurrence (in an outer range) it might identify, if one 
exists. When the corresponding defining occurrence is 
encountered or verified (by re-entering the range containing 
the defining cccurrence that the applied occurrence might 
identify) then the transfer point is known and a Eranch 
ainstructicn to this transfer point will be inserted in the 
reserved location, thus completing the indirect Eranch. Many 
Slight mcdifications of this approach exist, depending on 
the instruction set of the physical machine fcr which the 


compiler is kteing implemented. 


Another problem peculiar to a one-pass compiler is the 
generaticn of code for coercions. In general, when a 
coercend is encountered it is not known what coercions, if 
any, are to tke performed until further source text is 
processed. The proceduring coercion affects whether or not a 
coercend is elaborated and thus provision must be made for 
the situation when a given coercend is not elakorated, but 
procedured. All cther coercions foliow elaboration of the 
coercend, and provision must be made for inserting code — 
these or at least branching to the code. Instead of 
detailing the method of solution, three examples will be 
given, illustrating the problems and the techniques used to 
sclve them. 


Example 1: 


This example illustrates why coercions are not, in general, 
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found until it is too late to insert the code for then 
directly in place in the object program. Consider a opr b, 
where “a" and "b" represent any coercends, and Opr 
representS any operator. First the coercend "a" will be 
processed and an a priori mode (or modes) obtained. The 
Operator opr will then be scanned but, in general, it will 
not yet be known what operator-defining occurrence this 
Operator-applied cccurrence identifies. Thus the coercend 
"b" must ke processed in crder to obtain the a pricri mode 
(OL modes) cr the right operand so that operator 
identification may be completed. Once the identification is 
complete then and only then will the coercions for "a" and 
"E" be known. 
Example 2; 
Consider a coercend in a context such that dereferencing and 
then widening must occur. Let "a" denote the coercend and 
"code(a)" the code generated for the elaboration of "a", 
Then the following sequence of "pnseudo-instructions" 
represents the flow of the code that will be generated. 

43 Sumtato 2. 

2: Code(a). 

3: dJumpito 6. 

4; Perhaps other code (generated while the compiler 

discovers the coercions required for "a"). 
Se Jump to ¢. 


6: Lereference (2) (ta represents the result of 
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elakorating code(a)). 

7: Widen (6). 

S21cJunp toes 

DOCH 
The transfer points for jumps and the locations from which 
jumps are made may be remembered by employing stack 
techniques. 
Example 3: 
Consider a coercend in a context such that widening, 
proceduring, and rowing must occur. "a" and "“code(a)" are as 
in example 2. The following sequence of pseudo-instructions 
represents the flow of the code that will be generated. 

1: Jumeptoanho. 

PoarcG@aode (a)r. 

2: Tiunpebe tc. 

4: Perhaps cther code. 

5° Tdump "coats 

6: Widen(2). 

Teagkpileg: 

#) Thecldge 

Snerdunpnto 2. 

10: koutine Lruilder (8). 

Tie Row(10). 

Teer WO Eto 4. 

lars eos 


"Prolog" and “epilog" represent standard code generated for 
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procedure entry and exit. "Routine builder" takes the 
location of the procedured coercend ("8" in the above) and 
builds a representation for the procedure (envircnment 


informaticn must be added). 


Little detail will be given in this discussion of the 
runtime environment, again because investigation is 
incomplete and because there exist methods proposed for 
multi-pass compilers that will, with little modification, 


suffice for the proposed one-pass compiler. 


When an AIGOL 68 object program is running, three 
separate parts of storage may be distinguished. 
1. The "coffin" contains object-code and constant data for 
the frogram. 
2. The "stack", on which is allocated the space for most 
locally declared ckjects, for intermediate results, and for 
organizational entities. 
a4 The "heap", on which space is allocated for ail global 
generaticns, and for local objects whose storage 


requirements may vary during elaboration. 


The stack and the heap will occupy opposite ends of an 
area of storage and grow towards each other as space is 


allocated on each of them. The recovery of space from the 


,4 “— 





























ne 

edt eexst “sebliud oatauod™ te 
may 

bas {evods ed? al "8". basonse> Sexubae o1q 
jneanozivas) savbeso7g eit got aoists ” 


— —⸗ * 


i 


ar oe ee 


ett to aolespo226 etds at aevey od iftw fisteb of: 
al noiseoiteavad — aise , fwemnoz eve 
101 beaezorq etodron deine saedi sausoaed — 
,1oidsoitebom —— djtw ,ifiw sed azsLkquod am 

-tefiqmo> agsq-sno bexoqozy vi 


* rer: 


eeidt ,poiaawa et mpaporq toetdo da 108th. nae bande 
7 24 3 : es 


-bodeinpaiterb od ysm spsrotea ** ⸗⸗aea 12s9os 


aut 


104 s¢sb tastemoo bas eboo-soetdo eaistaop vette ‘od? X 
2 a 


a ts _ ; —* 

teoa 120% sosqe ei doidw ao —E 
vot bas ,etivaez etsihemtotsi — baas tost 
api staged J 

Sedolp Lie 120% betsoof{is ei s0Bqe doinw ao 
epsi0se eeodw atoestde Isool 102 has i 
.coltnitodsls patrub: tier y6e adaon 

as to abas seteogqo yquooe Liiw — ont. bas oe 


id 


al sosqae 26 isitoe dose ebaswos woxR — ee 


* al dal 


Wor 
eft mort soeqe to oop T .@ o dose ao be 
aoe ote 
ae 1D i eo”7_. 





i 
ia 


78 


stack will ke achieved using stack techniques similar to 
those of ALGOL 60 implementation [23-Chapter 8, 25]. Space 
will te recovered from the heap whenever a collision between 
the stack and heap is imminent. The process of recovering 
Space from the heap is known as garbage-collection and 
techniques for this are described in detail in [27, 39- 


Chapter 10, 52 |< 


The stack will be organized procedure-wise, that is, 
theststacka willteconsist sofeblocks ofrstorage calledsstack 
cells, of which is the storage local tc a particular 
invocaticn of a routine. |{39-Chapter 10, 53] contain further 
details of runtime environment including the representation 


of values and detailed stack organization. 


The runtime environment will include facilities for 
runtime debucging and post-mortem dumps. if requested, a 
modified copy of the source program will be retained during 
elaboration cf the object program. This copy of the source 
program may ke compacted scnewhat because the compiler 
tables lee also be retained, remembering that the 
implementation is a compile-and-go system. Traces, statement 
counts, and post-mortem dumps will then be made available to 


the user in a similar fashion to that described in [54]. 
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CHAPTER V 


CONCLUSICN 


The objectives of this investigation were twofold. 
Firstly, a syntax for a modified versicn cf ALGCL 68 was 
developed. This syntax, together with a corresponding 
ccntext-free syntax and a representation language, appear as 
Appendices to this thesis. The modifications made to the 
language descrikted in the Report, consist of additions and 
alterations considered to enhance the language from a 
user's, as well as an implementor's point of view. Included 
among the modifications were some of the many suggestions 
for additions and alterations proposed since the publishing 
cf the Report, as well as some changes arising out of this 
investigation. The modifications and reasons fcr their 
incorporation are discussed in Chapter II of this thesis. 
Some suggestions for semantic changes are included but 
emphasis is generally on syntactic rather than semantic 


details. 


The seccnd cbjective of this thesis was the design of 


a cne-pass ccempiler for ALGOL Go « Implementation 
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dependencies and restrictions imposed to facilitate the one- 
pass implementaticn are discussed in Chapter III. Chapter IV 
describes techniques for some of the tasks which the 
compiler must perform. Experience gained while designing the 
compiler was of considerable importance in the consideration 


cf modifications to the language. 


ALGCL 68 is a powerful programming language 
incorporating and generalizing many concepts frcem other 
programming languages and from current literature. As well 
as this, many new features are introduced and these give 
rise tc difficulties hitherto unencountered in the 
implementation of programming languages. Implementation of 
ALGOL 68 is thus more difficult and more expensive than, for 
example, the implementation of ALGOL 60. Chapter IV of this 
thesis is by no means complete in its description of the 
proposed cne-pass ccmpiler and there remain many problems on 
which further research must be carried out. These problems 
however, are not just peculiar to one-pass compilers but to 
multi-pass compilers as well. At this stage it would seem 
that a one-pass compiler for the given suklanguage of ALGOL 
68 may ke ccnstructed, provided that a multi-pass compiler 
for the unrestricted language is possible. The quality of 
object code produced by this one-pass compiler will be poor 
when compared with that code which could be produced by a 


multi-pass compiler. This is because of the extra 


os al 
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information available to the multi-pass compiler when 


generating okject code. 


It is proposed to implement a one-pass compiler based 
on the design presented in this thesis. The compiler will be 
written in a low-level language of a particular physical 
Machine. A lcwxlevel language is chosen so that efficiency 
of the compiler is improved, despite the increases in 
programming and debugging time. There are two proposed 
usages for (thetechpilecsp Firstly, it “wilik ~ be “used, in an 
educational environment providing fast turn-around for 
normal, small-to-medium size programs, together with good 
debugging and diagnostic facilities. Secondly, the compiler 
will ke used to experiment with the inclusion and exclusion 
of various features of ALGOL 68, thus giving some idea of 
the cost, with respect to both efficiency and size, of 
including certain of these features in ALGOL 68 and its 


sublanguages. 
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APPENDIX A 


A MODIFIED ALGOL 68 SYNTAX 


This syntax is presented in the same order as that of 
the Report, with ccrresponding section numbers and headings. 
Vertical lines in the left margin denote those parts of the 
syntax that are additions or alterations. Omissions in the 
ordering sequence corresfond to productions present in the 


Report but not included in this syntax. 


1.2.1. Metaproduction Rules of Modes 


a) MODE: MCOD; UNITED. 
b) MOOD: TYPE; STOWED. 
€) TYEE: ELAIN; format; PROCEDURE; reference to MCDE. 
d) PLAIN: INTREAL; boclean; character. 
e) INIREAL: INTEGRAL; REAL. 
{ £) INTEGRAL: SHCNGSETY integral. 
| g) REAL: SHCNGSETY real. 
| ha) SHONGSETY: long LONGSETY; short SHORTSETY; EMPTY. 
| hb) LCNGSETY: long LONGSETY; EMPTY. 
{ hc) SHORTSETY: short SHORTSETY; EMPTY. 
{hd) SHCNG: short; long. 
i) — 
3) PROCEDURE: procedure PARAMETY MOID. 
k) PARAMETY: with PARAMETERS; EMPTY. 
¥) PARAMETERS: PARAMETER; PARAMETERS and PARAMETER. 
ma) PARAMETER: MCDE parameter. 
| mb) PRAMETERS: parameter and MODE PRAMETERS; parameter. 
n) MOID: MCDE; void. 
{ o) SfCWED: structured with FIELDS; ROWS of MODE. 
p) FIELDS: FIFLD; FIELDS and FIELD. 
ga) FIELD: MODE field TAG. 
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FOLDS: field TAG and FIELDS; fieid TAG. 

ORGS SLELIERs TAG OLETIER; TAG DIGIT. 

LETIER: letter ALPHA; letter aleph. 

BEBHAYPaheh fecatdsacsCLatggen: 13.3; Kk; 13 m3 ns 03 Ps 
QFEESMSLOCSLUpeN SOs XS YY; 02s 

DIGIT: digit FIGURE. 

FIGURE: zero; one; two; three; four; five; six; seven; 
€ight; nine. 

UNITED: unicn of LMOODS MOOD mode. 

LMOODS: LMOOD; LMOCDS LMOOT. 

LMCCC: MOOD and. 


2. Metaproduction Rules Associated with Modes 


ROWS: row; RCWS row. 

ROW: row; row of. 

ROWSHETY s FRGWS; REM PLY. 

NONROW: NCNSTOWED; structured with FIELDS. 

NCNSTOWED: LYPEs sUNZTED. 

REFETY: reference to; EMPTY. 

NCNEROC: FLAIN; format; reference to NCNFRCC; 
procedure with FARAMETERS MOID; UNITED; 
structured with FIELDS; row of MODE. 

PRAM: procedure with RMODE parameter MOID; 
procedure with LMODE parameter and KMCDE parameter 
MOID: 

LMCDE:)MOCDE. 

RMCDE: MODE. 

LMCCT: MCCD and. 

LMOODSETY: MOOD and LMCODSETY; EMPTY. 

RMOCDSETY: RMOODSETY and MOOD; EMPTY. 

LOSETY: LMOODSETY. 

BOX: LMCCDSETY box. 

LFIELDSETY: «cFLELDS<and; EMFTY. 

RFIEFLDSETY: and FIELDS; EMPTY. 

CCMELEX: structured with real field letter r letter e 
and real field letter i letter m. 

BITS: structured with row cf boolean field SHCNGTHETY 
letter aleph. 

SHCNGTHETY: LENGTH LENGTHETY; SHORTH SHCRTHETY; EMPTY. 

LENGTHETY: LENGTH LENGHTHETY; EMPTY. 

SHORTHETY: SHORTH SHORTIHETY; EMPTY. 

LENGTH: letter 1 letter o letter n letter g. 

SHORTH: letter s letter h letter o letter r letter t. 

BYTES: structured with row of character fieid 
SHCNGTHETY letter aleph. 

STRING: row of character; character. 

MABEL: MODE mode; label. 
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1.2.3. Metaproduction Rules Associated with Phrases and 
Coercion 
a) PHRASE: declaration; CLAUSE. 
b) CLAUSE: “MOCID clause. 
} ©) SOMEteserial;s tnitary; CLOSED; choice; chooser; in; 
Out; repetitive; FROBYT; while; do. 
{ d) CLCSED: closed; collateral; CONFASE. 
{ £) |.CCNFASE:)> conditional; case; conformity case. 
g) SORTS Strong; FEAT. 
} h) FEAT: firm; meek; weak; soft. 
3) STIRMGLstrongsifirns: 
k) ALAFPTED: ADJUSTED; widened; rowed; hipped; voided. 
1) ADJUSTED: FITTED; procedured; united. 
m) FITTED: dereferenced; deprocedured. 
| n) CCNFETY: conformity; EMPTY. 
| 0) FRCBY Ttetrongaby yetos 
1.2.4. Metaproduction Kkules Associated with Coercends 
a) COERCEND: MOID FORM. 
b) FORM: confrontation; FCRESE. 
{| c) FORESE: PRICKETY ALIC formula; cohesion; base; 
routine denotation. 
fd) ADIC<e dyadic; \monadic. 
ea) PRICRITY: priority NUMBER. 
{eb) PRIORITIES: PRIORITY; priority NINE plus one. 
| ec) PRIGRETYS, PRIORITIES; EMETY. 
ti) NUMBER: one; TWO; THREE; FOUR; FIVE; SIX; SEVEN; 
EIGHT; NINE. 
gq) IWC: one plus one. 
Hh) THREES@IWOSrpius ones 
i) FOUR: THREE plus one. 
j) FIVE: “FCUR plus) one. 
k) SIXSOFLVE,plusvone: 
1) SEVEN: SIX plus one. 
m) EIGHT: SEVEN plus one. 
n) NINE: EIGHT pilus one. 
1.2.5. Cther Metaproduction Rules 
a) VICTAL: VIRACT; formal. 
Ea) VIRACT: virtual; actual. 
tbh) © VIRMAL: virtual; “formal. 
c) LOWPER: lower; upper. 
d) ANY: KIND; suppressible KIND; replicatakle KIND; 
replicatable suppressible KIND. 
e) KIND: sign; zero; digit; point; exponent; complex; 


string; character. 
NOTICN: ALEHA; NOTICN ALPHA. 
SEPARATOR: LIST separator; go cn tokyl; completer ; 
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sequencer. 

LIST: list; sequence. 

PACK: pack; package. 

BRACKET: bracket; packet; pack. 
RALIX: two; four; eight; sixteen. 
MATCH: strop; brief. 


The Computer and the Frogram 


program: open symbol, standard prelude, likrary 
Frelude option, particular program, exit, library 
postlude option, standard postlude, close symbol. 

standard prelude: declaraticn prelude sequence. 

liktrary prelude: declaration prelude sequence. 

particular program: strong serial void clause PACK, 
comment sequence option. 

exit: go on symbol, letter e letter x letter i letter 
t, lakel symbol. 

library fostlude: statement interlude. 

standard postlude: strong void clause train. 


1. General Constructions 


NOTION cpticn: NOTION; EMPTY. 
chain of NCTIONS separated by SEPARATORS: NOTICN; 
NOTICN, SEPARATOR, chain of NOTIONS separated by 
SEPARATCEs. 
NOTICN LIST: 
chain of NOTIONS separated by LIST separators. 
list separator: comma tokyl. 
sequence separator: EMPTY. 
NOTEIGNHLISTkpropens 
NOTICN, LIST separator, NOTION LIST. 
NOTION pack: open tokyl, NOTION, close tokyl. 
NOTION package: begin tokyl, NOTION, end tokyl. 
NOTICN Eracket: sub tokyl, NOTION, bus tokyl. 
NOTION packet: 
alternate sub tokyl, NOTION, alternate bus tokyl. 
NOTION tokyl: comment sequence option, NCTICN symbol. 


2. Letter Ickens 
LEFITER: LETTER tokyl. 
3. Digit»Tckens 


digit token: DIGIT. 
DIGIT: DIGIT tckyl. 
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3.0.9. Comments 


b) comment: MATCH comment tokyl, comment item sequence 
cpticn, MATCH comment symbol. 
c) comment item: character token; cther comment item. 
d) character token: LETTER symbol; DIGIT symbol; 
point symbol; times ten to the power symbol; 
open symbol; close symbol; comma symbol; 
space symbol; plus symbol; minus symkol. 


GS lea be a2dentifiers 


a)* identifier: MABFL identifier. 

D) MABEL identifier: TAG. 

CG) TAGIVETIER: TAG, LETTER, 

d) TAGEDAGCIISaTAaG, pDicisz. 

e)* ranges program; SORT serial’ CLAUSES; 
Erocedure PARAMETY MOID routine denotation; 
SOnLe MOLD conformity unit; ———— 


uy 2s nandncathcns 


a) * indication: 
MODE mode indication; PRIORETY ADIC indication. 
b) MODE mode indication: 
mode standard; comment sequence option, indicant. 
c) mode standard: koolean tokyl; character tokyl; 
formatetokylsastringatokyl:; sema tokyl; file tokyl; 
SHCNG tokyl sequence option, integral tokyl; 
SHONG tokyl sequence option, real tokyl; 
SHONG tokyl sequence option, complex tokyl; 
SHCNG tokyl seguence option, bits tokyl; 
SHCNG tokyl sequence option, bytes tokyl. 
d) PRIORITIES ADIC indication: ADIC indication. 
e) ALIC indication: SHONG tokyl sequence cption, comment 
sequence cption, ADIC indicant. 
g)* adic indication: PRIORETY ALIC indication. 


Me ouiis GPrerators 
a)* operator: FRAM PRIORETY ADIC operator. 


b) PRAM PRIORETY ADIC operator: PRIORETY ADIC indication. 
d)* ADIC operator: PRAM PRIORETY ALIC operator. 


5.0.1. Denotaticns 


a) * denotation: PLAIN denotation; BITS denotation; 
row of character denotation; format denotation; 
procedure PARAMETY MOID denotation. 
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ae 1. CRT SCR Lain Detiotations 


a)* plain denotation: PLAIN denotation. 
{ b) SHONG INTREAL denotation: 
SHONG tokyl, INTREAL denotation. 


5.1.21.1. Integral Denotations 
a) integral denctation: digit token sequence. 
5.1.2.1. Real Denotaticns 


a) real denotation: 
variakle point numeral; floating point numeral. 
b) variable point numeral: 
integral part option, fractional fart. 
G) integral part: integral denotation. 
| 4d) fractional part: point tokyl, integral denotation. 
e) flcating point numeral: stagnant part, exponent part. 
£f) stagnant part: 
integral denotation; variable point numeral. 
9) expenent part: 
times ten to the power choice, power of ten. 
{ h) times ten to the power choice: 
times ten to the power tokyl; letter e. 
i) power of ten: plusminus option, integral denotation. 
| 3) plusminus: plus tokyl, minus tokyl. 


5.1.3.1. Boclean Denotations 
j} a) koolean denotation: true tokyl; false tokyl. 
5.1.4.1. Character Denotations 


| a) character denotation: 
MATCH quote tokyl, string item, MATCH quote symbol. 
b) string item: 
character token; quote image; other string item. 
| c) quote image: MATCH quote symbol, MATCH quote symbol. 


5.2.1. Bits Denotations 


a)* bits denotation: BITS denotation. 

{ba) structured with row of boolean field LENGTH LENGTHETY 
letter aleph denotation: 
long tckyl, structured with row of boolean field 
LENGIBETY letter aleph denotation. 

Jbb) structured with row of boolean field SHCRTH SHORTHETY 
letter aleph denotation: 
shert tckyl, structured with row of koolean field 
SECRTHETY letter aleph denotation. 
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structured with row of boolean field letter aleph 
denotation: 
RADIX radix, letter r, RADIX byte sequence. 

LWOLTaARixcnhdigitatweo, 

EOUTETadixsldigitefour. 

S1ght®radix: digditgei gher 

Sixteenttadix: digit one, digit six. 

byte: RADIX byte. 

two byte: digit zero; digit one. 

fouri byte: twowbytesedigdti two; digit three. 

eight byte: four byte; digit four; digit five; 
digit six; digit seven. 

Sixteen Lyte: 
eight byte; digit eight; digit nine; letter a; 
letter b; letter c; letter d; letter e; letter f. 


1. String CDenotations 


string denotation: row of character denotation. 
row of character denotation: MATCH quote tokyl, string 
item sequence proper option, MATCH guote symbol. 


1. Koutine Lenotations 


routine denotation: 
procedure PARAMETY MOID routine denctation. 

procedure MOID routine denotation: virtual MCID 
declarer, routine tckyl, strong MOID unit. 

procedure with FARAMETERS MOID routine denotation: 
formal EFARAMETEKS pack, virtual MOID declarer, 
routine tokyl, strong MOID unit. 

VIRACT FARAMETERS and FARAMETER: 
VIRACT PARAMETERS, comma tokyl, VIRACT FARAMETER. 

formal MODE PRAMETERS: formal MODE declarer, formal 
MCDE FRAMETERS definiticn. 

fortal MODE parameter and MODE PRAMETERS definition: 
MODE mode identifier, ccmma tckyl, formal MODE 
PRAMETEKS definition. 

formal MODE parameter definition: 
MODE mode identifier. 

formal MODE parameter and RMODE PRAMETERS definition: 
MODE mode identifier, comma tokyl, formal RMODE 
PRAMETEFS. 

VICIAL parameters pack: VICTAL PARAMETERS pack. 


1. Format Denotations 


format denotation: 
formatter tokyl, collection list, formatter tokyl. 
collection: picture; insertion option, replicator, 
collection list pack, insertion option. 
picture: MODE pattern option, insertion option. 
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inserticn: literal opticn, insert sequence; literal. 
IMSert. replicator, alignment, literal option. 
replicator: replication option. 
replication: dynamic replication; integral denotation. 
dynamic replication: 
letter n, strong CLiCSED integral clause. 
alignment: 
Lettenrakselettectxttlettereys.letter 1; Letter p. 
literal: replicator, SIRING denotation, replicated 
literal sequence option. 
replicated literal: replication, STRING denotation. 
sign mould: loose replicatable zero frame, sign frame; 
loose sign frame. 
loose ANY frame: insertion option, ANY frame. 
replicatable ANY frame: replicator, ANY frame. 
zere frame: letter Zz. 
Sign frame: plusminus. 
suppressible ANY frame: letter s option, ANY frame. 
frame: ANY frame. 


2. Syntax of Integral Fatterns 


integral pattern: sign mould option, integral mould; 
integral choice pattern. 

integral mould: loose replicatable suppressibie digit 
frame sequence. 

digit frame: zero frame; letter d. 

integral choice pattern: 
inserticn option, letter c, literal list pack. 


B. Syntaxeofiefkeal Patterns 


real pattern: sign mould option, real mould; 
floating pcint mould. 
real mould: integral mould, loose suppressible point 
frame, integral mould option; 
locse suppressible point frame, integral mould. 
point frame: point tokyl. 
flcating pcint mould: 
stagnant mould, loose suppressible exponent frame, 
sign mould option, integral mould. 
stagnant mould: sign mould option, INTREAL mould. 
exponent frame: letter e. 


4. Syntax of Boolean Patterns 


boolean pattern: insertion option, letter bt, boolean 
choice mould opticn. 

boclean choice mould: open tokyl, literal, comma 
tokyl, “literal, close tokyl. 
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59.5.5. Syntax of Character Patterns 


a) character pattern: loose suppressible character frame. 
b) character frame: letter a. 


5. 5. 6. Syntax of Complex Patterns 


a)* complex pattern: COMPLEX pattern. 

b) COMPLEX pattern: real pattern, loose suprressible 
complex frame, real pattern. 

c) complex frame: letter i. 


5.9./. Syntax ofpString Patterns 


a)* strangl pattern: rowhofecharacter pacternh: 

b) row of character pattern: loose string frame; 
loose replicatable suppressiftle character frame 
sequence proper; 
inserticn option, replication, supfressible 
Character frame. 

¢) Stringo®itame:* letter ts 


5.5.7A. Syntax of Bits Patterns 


a)* bits pattern: structured with row of koolean fieid 
letter aleph pattern. 

b) structured’ with row of boolean field letter aleph 
pattern: radix mould, integral mould. 

c) radix mould: insertion option, RADIX radix, letter rc. 


5.5.8. Iransformats 


a) structured* with! row) of character' field letter aleph 
digit one transformat: firm format unit. 


6.0.1. Phrases 


a) * SCME phrase: SORT SOME PHRASE. 

b) * SCME expression: SORT SOME MODE clause. 

c)* SCME statement: strong SOME void clause. 

d)* MODE constant: MODE FORM. 

e)* MODE variakle: reference to MODE FORM. 

f) * procedure: REFETY FROCEDURE FORM. 

g)* structure display: STIRM collateral structured with 
FIFLDS and FIELD clause. 

h)* row display: STIRM collateral ROW MODE clause. 
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1. Serial Clauses 


SORT serial CLAUSE: declaration prelude sequence 
Option, Suites or esorRT CLAUSE trains. 

declaration prelude: statement prelude cption, single 
declaration, go on tokyl. 

statement prelude: chain of strong void units 
separated by go cn tokyls, go on tokyl. 

single declaration: 
unitary declaration; collateral declaration. 

SORT MOID unit: SORT unitary MOID clause. 

Suite of strong CLAUSE trains: chain of strong CLAUSE 
trains separated by completers. 

sunte rot tPEATSCLAUSERtraines PEAT PCLAUSE strain; 
FEAT CLAUSE train, completer, suite of strong 
CLAUSE trains; 
strong ‘CLAUSE train, completer,; suite of FEAT 
CLAUSE trains. 

SORT MOID clause train: label sequence option, 
statement interlude option, SORT MOIL unit. 

statement interlude: chain of strong void units 
separated by sequencers, sequencer. 

sequencer: go on tokyl, label sequence option. 

label: label identifier, label tokyl. 

completer: completion tokyl, label. 


1. Collateral Phrases 


collateral declaration: 
unitary declaration list proper. 

strong collateral void clause: parallel symbol option, 
strong void unit list proper PACK. 

STIRM collateral ROW MODE clause: 

STIRM MCDE balance PACK. 

strong MCID CONFETY balance: 
strong MOID CONFETY unit list proper. 

FEAL MOID CONFETY balance: FEAT MOID CCNFETY unit, 
comma tckyl, strong MOID CONFETY unit list; 
etrong MCID CONFETY NnVt, comma tokvl, “BEAT MOLD 
CONFETY unit; 
strong MOID CONFETY unit, comma tokyl, FEAT MOID 
CCNFETY Ealance. 

STIRM ccllateral structured with FIELDS and FIELD 
structure clause: STIRM structured with FIELDS and 
FIFLD structure PACK. 

STIRM structured with FIELDS and FIELD structure: 
STIRM structured with FIELDS structure, comma 
tokyl, STIKM structured with FIELD structure. 

STIRM stuctured with MODE field TAG structure: 
STIRM MCDE unit. 
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1. Clo&Sedi Clauses 
SORTAGlOSEed CLAUSES RSORDEServalZGLAUSE PACK. 
1. Conditional Clauses 


SORT CCNFASE CLAUSE; 

MATCH CCNFASE begin tokyl, SORT MATCH CCNFASE 
chooser CLAUSE, MATCH CONFASE end tokyl. 

SORT MATCH CCNFASE chooser CLAUSE: 

CONFASE, SORT MATCH CONFASE choice CIAUSE. 
conditicnal: strong serial boolean clause. 

case: strong serial integral clause. 

conformity case: meek serial UNITED clause. 

SORT MATCH conditional choice CLAUSE: MATCH thef 
tokyl, SCKI MATCH conditional chooser CLAUSE. 

strong MATCH CONFASE choice CLAUSE: 
StLOnNGSHATICH CONFASE*®in CLAUSE, "Strong? MATCH 
CONFASE out CLAUSE option. 

FEAT MATCH CONFASE choice CLAUSE: FEAT MATCH CONFASE 
in CLAUSE, strong MATCH CONFASE out CLAUSE option; 
strong MATCH CONFASE in CLAUSE, FEAT MATCH CONFASE 
out CLAUSE. 

SORT MAICH conditional in CLAUSE: 

MATCH conditional in tokyl, SORT serial CLAUSE. 

SORT MAICH case in MOID clause: 

MATCH case in tokyl, SORT MOID balance. 

SORT MATCH conformity case in MOID clause: 

MATCH conformity case in tokyl, SORT MOTD 
CONLOERITY unit; 

MATCH conformity case in tokyl, SORT MOID 
conformity balance. 

SORT MATCH CONFASE out CLAUSE: 

MATCH CONFASE out tokyl, SORT serial CLAUSE; 
MATCH CCNFASE out begin tokyl, SORT MATCH CCNFASE 
chooser CLAUSE. 

SORT MOID conformity unit: 

MODE specification, SORT MOID unit. 

MOLE specification: 
cpen tokyl, formal MODE declarer, MOLE mode 
identifier option, close tokyl, choice tokyi. 


1. LTeclarers 


declarer: VICTAL MCDE declarer. 
VICTAL MODE declarer: 
VICTAL MODE declarator; MODE mode indication. 
VICTAL structured with FIELDS declarator: 
structure tokyl, VICTAL FIELDS declarator pack. 
VICTAL MODE field TAG and MODE FOLDS continuation: 
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MODE field TAG selector, comma tokyl, VICTAL MODE 
FOLDS continuation. 

VICTAL MODE field TAG continuation: 
MODE field TAG selector. 

VICTAL MODE field TAG and RMODE FOLDS continuation: 
MODE field TAG selector, comma tokyl, VICTAL RMODE 
FCIDS declarator. 

field declarator: VICTAL FIELD declarator. 

VICTAL STCWED FOLDS declarator: VICTAL STOWED 
declarer, VICTAL STOWED FOLDS continuation. 

field selector: FIELD selector. 

MODE field TAG selector: TAG. 

VICTAL NCNSTOWED FOLDS declarator: virtual NONSTOWED 
declarer, VICTAL NCNSTOWED FOLDS continuation. 

VIRACT reference to MODE declarator: 
reference to tokyl, virtual MODE declarer. 

formal reference to STOWED declarator: 
reference to tokyl, formal STOWED declarer. 

formal reference tc NONSTOWED declarator: 
reference to tokyl, virtual NONSTOWED declarer. 

VICTAL KCWS of STOWED declarator: 

VICTAL FCWS rower BRACKET, VICTAL STOWED declarer. 

VICTAL ROWS of NONSTOWED declarator: VICTAL ROWS rower 
FRACKET, virtual NONSTOWED declarer. 

VICTAL rcw KOWS rower: 

VICTAL row rower, comma tokyl, VICTAL ROWS rower. 

VICTAL row rower: VICTAL lower bound, up to tokyl, 
VICTAL upper bound. 

VIRMAL recw rower: EMPTY. 

virtual ICWPER bound: EMPTY. 

actual LOWPER bound: 
strict LCWPEK bound, flexible tokyl option. 

strict ILCWPEF bound: strong integral unit. 

formal LCWPEE bound: 
flexible tckyl option; either tokyl. 

VICTAL EROCELURE declarator: 
procedure tokyl, virtual PROCEDURE plan. 

virtual procedure with PARAMETERS MOID flan: 
virtual FARAMETERS pack, virtual MOID declarer. 

virtual MODE parameter: virtual MODE declarer. 

virtual vcid declarer: void tokyl. 

virtual procedure MOID plan: virtual MCID declarer. 

parameters pack: VICTAL PARAMETERS pack. 

VICTAL union of LMOCDS MOOD mode declarator: 
union of tokyl, LMCODS MOOD and open box pack. 

LOSETY LMOOD open BOX: LOSETY closed LMCCD end POX. 

LOSETY closed LMOODSETY LMOOD end BOX: 

LOSETY closed LMOODSETY LMOOD LMOOD end BOX; 
LOSETY open LMOODSETY LMOOD BOX. 

LOSETY closed LMOODSETY LMOOD end LMOCT BOX: 
LOSETY closed LMOODSETY LMOOT LMOOD end FOX. 

open LMCODS LMOCD BOX: LMOOCDS LMOOD BCX; 
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Open LMOODS kox, comma tokyl, LMOOD ECX. 
Ope alin CCD Er exsweLNOOD box. 
LMOODS MCCD and box: 
union of ILMOODS MOOD mode mode indication; 
union cf tokyl, open LMOODS MOOD and box pack. 
MOOD and Eox: virtual MOOD declarer. 


1. Mode Declarations 


mode declaration: mode tokyl, mode definition list; 
union of tokyl, union mode definition list; 
sructure tckyl, structured mode definition list. 

mode definition: MODE mode indication, equals tckyl, 
actual MCDE declarer. 

unicn mode definition: 
union of LMOCDS MOOD mode mode indication, equals 
tokyl, IMCODS MCOD and open box pack. 

structured mode definition: 
structured with FIELDS mode indication, equals 
tokyl, actual FIELDS declarator pack. 


1. Priority Declaraticns 


priority declaration: 
prior pepiicriity definition list. 

priority definition: priority NUMBER dyadic 
indicaticn, equals tokyl, NUMBER token. 

one token: digit one tckyl. 

TWO token: digit two tokyl. 

THREE tcken: digit three tokyl. 

FOUR tcken: digit four tokyl. 

FIVE tcken: digit five tokyl. 

SLX ftokiemst idag itesmsestokyl. 

SEVEN tcken: digit seven tokyl. 

EIGHT tcken: digit eight tckyl. 

NINE token: digit nine tokyl. 


1. Identity Declarations 


identity declaration: 
formal MCDE declarer, formal MODE definition list; 
heap tokyl. opticn, actual MODE declarer, actual 
MODE definition list; 
procedure tokyl, procedure PARAMETY MOID definition 
lasst . 

formal MODE definition: MODE mode identifier, equals 
tokyl, actual MODE parameter. 

actual MODE definition: reference to MCDE mode 
identifier, MODE initialization option. 

MODE initialization: becomes tckyl, MODE source. 

procedure PARAMETY MOID definition: 
procedure PARAMETY MOID mode identifier, equals 
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tokyl, procedure PARAMETY MOID routine denotation. 
actual MODE parameter: 
strong MODE unit; MCDE trdansformat: 


1. Cperation Declarations 


Operaticn declaration: 
operaticn tokyl, operator definition list; 
operaticn tckyl, virtual PRAM plan, ERAM ADIC 
operator definition list. 

operator definition: 
virtual FRAM plan, PRAM ALDIC operator, equais 
tokyl, actual PRAM parameter; 
FRAM ALIC operator, equals tokyl, PRAM routine 
denotaticn. 

PRAM ADIC operator definition: PRAM ADIC operator, 
equals tokyl, actual PRAM parameter. 


1. Unitary Clauses 


SORTOUNItary MOID clause: SORT MOID tertiary; 
SORT repetitive MOID clause; 
SORT MCID routine denotation; 
SORT MCID confrecntation. 
SORT MOID tertiary: SORT MOID secondary; 
SORT MOID PRIORITIES ADIC fcrmula. 
SORT MOID secondary: SORT MOID primary; 
SORT MCID cohesion. 
SORT MOIP primary: SORT MOID base; 
SORT ECICSEDTNOIDeclauses 


2. Repetitive Clauses 


strong repetitive void clause: 
dovinitializetion part; -do Loops part. 
do initialization part: strong from integral ciause 
option, strong by integral clause option, strong to 
integral clause cption. 
do loop part: control identifier option, strong while 
toolean clause option, strong do void clause. 
strong FROBYI integral clause: 
FROBYT tokyl, strong serial integral clause. 
econtrolrvidentitier: 
for tokyl, integral mode identifier. 
strong while boolean clause: 
while toekyl, strong serial rEoclean clause. 
strong do void clause: 
do tokyl, strong unitary void clause. 
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6.2.0. 180Ccéerecéids 


a) * coercend: SORT COERCEND; SORT1lLy ADAPTEC to COERCEND. 
b) * SORT coercend: SORT COERCEND. 
c)* ALAPTED coercend: SORTly ALAPTED to COERCEND. 
qd) strong CCERCEND: 
COERCEND; strongly ALAPTED to COERCEND. 
0) firm COERCEND: COERCEND; firmly ADJUSTED to COERCEND. 
jfia) meek COERCEND: COERCEND; firmly FITTED to COERCEND. 
fb) weak COERCEND: COERCEND; weakly FITTED to COERCEND. 
g) soft COERCEND: 
COERCEND; softly deprocedured to COEFCEND. 


8.2.1.1. Dereferenced Coercends 


a) STIRMly dereferenced tc MODE FORM: 
reference to MODE FORM; 
STIRMly FITTED to reference to MODE FCRM. 

b) weakly dereferenced to reference to MODE FORM: 
reference to reference to MCDE FORM; 
weakly FITTED tc reference to reference to MODE 
FORM. 


8.2.2.1. Deprocedured Coercends 


jaa) STIRMly deprocedured tc MOID FORESE: 
procedure MOID FORESE; 
STIRMly FIITED to procedure MOID FORESE. 
jab) STIRMly deprocedured to MODE confrontation: 
Frocedure MODE confrontation; 
STIRMly FITTED to procedure MODE confrontation. 
b) weakly deprocedured to MODE FORM: procedure MODE FORM; 
firmly FITTED te procedure MODE FORM. 
c) softly deprocedured to MODE FORM: procedure MCDE FORM; 
softly deprocedured to procedure MODE FORM. 


8.2.3.1. EFrocedured Coercends 


a) STIRMly procedured to frocedure MOID FORM: MOID FORM; 
STIRMly dereferenced to MOID FORM; 
STIRMly procedured to MOID FORM; 
STIRMly united to MOCID FORM; 
STIRMly widened to MOID FORM; 
STIRMly rowed to MOID FORM. 


8.2.4.1. United Coercends 


a) STIRMly united to union of LMOODS MOOD mode FORM: 
cne cut of LMOOLDS MOOD mode FORM; 
scme of LMOODS MOOD and but not FORM. 

b) one out of LMOODSETY MCOD RMOODSETY mode FCRM: 
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HOGDSECR ME 
Tire lyeFLITED. to MOCD FORM: 
firmly procedured to MOCD FORM. 

cS) somegot LMOCDSETYe MOOD and RMOODSETY but not LCSETY 
EORMs 
scme of LMOODSETY and MOOD RMOODSETY but not LOSETY 
ioe Jelte| & 
scme of LMOODSETY RMOODSETY but not MOOD and LOSETY 
FORM. 

d) some of and LMOCD MCOD RMOODSETY but not LMCOT LCSETY 
FORM: 
union of LMOCD MOOD RMOODSETY mode FCRM; 
firmly FITTED to union of LMOOD MOOD KEMOODSETY mode 
FORM. 


8.2.5.1: Widened Coercends 


| a) strongly widened tc SHONGSETY real FORM: 

SHCNGSETY integral FORM; 
strongly FITTED to SHONGSETY integral FCRM. 

b) strongly widened tc structured with REAL field letter 
r letter e€ and REAL field letter i letter m FORM: 
REAL FORMsiStrongayiSLTTED to REAL FORM; 
strongly widened to REAL FORM. 

Cc) strongly widened to row of boolean FORM: BITS FORM; 
StronglyeFPLTAEDetosbREsSPFroR nM: 

d) strongly widened to row of character FORM: BYTES FORM; 
strongly) FILTED{Lo+ Bytes FORM: 


8.2.6.1. Rowed Coercends 


Jaa) strongly rowed to REFETY row ROWS of MODE FCRM: 
REFETY ROWS of MODE FORM; 
strongly ADJUSTED tc REFETY ROWS of MODE FCRM; 
strongly widened to REFETY ROWS of MODE FORM; 
strongly rowed to REFETY ROWS of MODE FORM. 
Jab) strongly rowed to REFETY row of MODE FOEM: 
REFETY MODE FORM; 
strongly ADJUSTED to REFETY MODE FORM; 
strongly widened to REFETY MODE FORM; 
strongly rowed to REFETY MODE FORM. 


8.2.701. Hipped:C€oercends 


{| a) strongly hipped to MOID base: 

MCID skip; MOID jump; MOID nihil; MOID vacuun. 
b) MOILC skip: skip tokyl. 
c) MOID jump: go to tokyl option; label identifier. 
d) reference to MODE nihil: nil tokyl. 
e) KOWS of MODE vacuum: open tokyl, close tokyl. 
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852.8749  VYoidedicoercthds 


a) strongly voided to void confrontation: 
MCDE confrontation. 

b) strongly voided to void FORESE: NONPROC FORESE; 
strongly deprocedured to NONPRCC FORESE. 


Sess. sCOntLOntations 


a) MOID ccnfrontation: MCID assignation; 
MOID identity relation; MOID cast. 


O23, ACH SSASSighations 


a) reference to MODE assignaticn: reference to MCDE 
destinaticn, becomes tokyl, MODE source. 

b) reference to MODE destination: 
scft reference to MODE tertiary. 

c) MODE source: strong MODE unit. 


S.3.3. ls@identity*kelatione 


a) boclean identity relation: 
soft reference to MODE tertiary, identity relator, 
strcng reference to MODE tertiary; 
etrong reference to MODE tertiary, identity 
relator, scft reference to MODE tertiary. 

Dy identity relator: is tokyl: 1s not tokyl. 


a clits lig TGEVReS 


a) MOID cast: virtual MOID declarer, cast of tokyl, 
strong MCID unit. 


8.4.1. Formulas 


a)* SORT formula: SORT MOID PRIORITIES ADIC formula. 

b) MOID PRICKITY dyadic formula: 
LMCDE FERICRITY operand, procedure with LMODE 
parameter and KMODE parameter MOID PRIORITY dyadic 
operator, KMCDE PRICRITY pius one operand. 

c) * operand: MCDE PRIORITIES operand. 

d) MODE PRICRITY operand: firm MODE PRIORITY dyadic 
formula; MODE PRIORITY plus one operand. 

e) MOLE priority NINE plus one operand: 
firm MCDE priority NINE plus one monadic formula; 
firm MCDE secondary. 

g) MOID priority NINE plus one monadic formula: 
procedure with RMODE parameter MOID priority NINE 
plus cne monadic operator, RMODE priority NINE plus 


one operand. 
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h)* dyadic formula: MOID PRIORITY dyadic formula. 


i)* monadic formula: 


MCID priority NINE plus one monadic formula. 


Essel. CoOhesions 


a) MOLE cohesicn: MODE generator; MODE selection. 


Beo. |. Ve. Generators 


a) MOLE dqenerator: 


MODE local generator; MODE global generator. 


b) reference to MODE local generator: 
local tokyl, actual MODE declarer. 

c) reference to MODE global generator: 
heap tokyl, actual MODE declarer. 


Beoees te. SELECTIONS 

a) REFETY MODE selection: MODE field TAG 
tokyl, weak REFETY structured with 
field TAG RFIELDSETY secondary. 

8.6.0.1. Bases 


a) MOID base: MCID mode identifier; MOID 
MOID slice; MOID : call. 


Bebe dele OLLCeSs 


{aa) REFETY ROWS of MODE slice; 


selector, of 
LFIELDSETY MODE 


denotation; 


weak REFETY ROWSETY ROWS of MODE primary, RECWSETY 


ROWS leaving ROWS indexer BRACKET. 


ROWS leaving EMPTY indexer BRACKET. 
b) row ROWS leaving row ROWSETY indexer: 


jab) REFETY MODE slice: weak RKEFETY ROWS of MODE primary, 


trimmer option, comma tokyl, ROWS leaving KOWSETY 


indexer; 


subscript, comma tckyl, ROWS leaving row KOWSETY 


indexer. 
c) row ROWS leaving EMPTY indexer: 


subscript, comma tokyl, ROWS leaving EMPTY indexer. 
d) row leaving row indexer: trimmer option. 


e) row leaving EMPTY indexer: subscript. 


f) trimmer: strict lower bound option, up to tokyl, stict 
upper bound option, new lower bound part option; 


new lower bound part. 


q) new lower bound part: at tokyl, new lower found. 


h) new lower bound: strong integral unit. 
i) sukscript: strong integral unit. 

j)* trimscript: trimmer; subscript. 

k) * indexer: RCWS leaving ROWSETY indexer. 
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1)* boundscript: strict LOWPER bound; new lower bound; 
subscript. 


bigeaalia ilo welliks 


a) MOIL call: firm procedure with PARAMETEFS MOID 
primary, actual PARAMETERS pack. 
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APPENDIX B 


A CONTEXT-EBREE, SYNTAX 


The context-free syntax given in this Appendix 
descriktes a superset of the modified syntax of ALGOL 68 
given in Appendix A. It is a superset in the sense that the 
transformation from the two-level syntax described in the 
Report to context-free syntax greatly reduces the amount of 
informaticn carried by the syntax. For e€xampie, neither 
modes nor coercions are expressed in the context-free 
syntax. Intrinsic ambiguities arising in this syntax are 


discussed in Chapter IV of this thesis. 


Note that productions with "empty" occurring on the 


right-hand side have been permitted in the following syntax. 


The syntactic entities are given, as far as possible, 
mnemcnic representations recalling the notions of the syntax 
given in Appendix A. Prefixes, infixes, and suffixes used in 
the ccmposition of the syntactic entities are listed below 
tegether with their meanings in terms of the syntax given in 
Appendix A. This table should be regarded only as an aid for 


reading the syntax. 
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Mien exsees actnal 
align....alignment 
assig....assignation 
begtss4e4 begin 
bool.....btoolean 
char.....echaracter 
Chol.....choice 

Git .ss.ss Clause 

ROO. ws 6 ss closed 
aise awe acOULbabtera d 
epilect..collection 
Gond.....conditional 
[Witels «se CONLOTMItyecase 


Comper nt. CONnLLOCnLation 


Oss eis 
dec... 
decer. 
deneee 
G1. as 
GOL. «s 
GY sees 
CLVese 
CX Dis vee 
ned Me ENS 


EOS ee 


e 
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~-CONFASE 
-declaration 
.declarer 
-denotation 
sdigit 
-declarator 
-dyadic 
eempty 
-€xponent 
floating 


HLOrMat 


MO. saves 


MONeeee 
NOS sss 
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GPasess 
PUtsses 
PaALecee 
Pat.ce. 
PKe coos 
prel... 
DUIS wate 
PrOC eer 
Pte weer 
Lefeeee 
TEleaes 


replic. 


repliit.. 


EDN 5 oss 
Sececcee 
SCLecee 
ShO.cece 
SPeecece 
Spec... 
Stacece 


stag... 


-- mould 

«-mMonadic 
«-NONSTOWED 

». long 

».-Operation 

ss PELOTUTY 

ee parameter 
«e-pattern 

es pack 

»-prelude 

SPL VOrieuy: 
-.procedtre 

eee On 

.. Lreference to 
«relation 
..replication 
--replicated literal 
erLouts ne 
»-sequence, list 
«.serial 

oe SHOrt 
«.sequence proper 
..specification 
- Statement 


«-Stagnant 


107 


RE th aad sqhes dl 
Bipow........0m ] 
DEBBAOD «++ 400 0itOM ; 
@EWOPZUON. ».. oe sO 
GAOda» ses <o Qh 
NOLFHIGGO. . 6-06-49 G 
UPEAOLIGs oo eseeed 

1OPSOB18G.06e50ethg |” 
AIPPSIBYe ees s+ FHZ 
AOBQ sve eee AAG 
phy lerq......fong 
yi iso bag....+--fIg | * 
8 12205q — 
PAEOG sos 9 250 ¢24uz0%a00 


poe te 
oF Ssoasteterz ene tees toa soitesnoa 09. 





MOLIBLOT oo we «oe LOT 
aoisevilqer....0iIged 
{seietil bBeteokiqes..sstiiges 
arlTo9%. ss oey ATR" 

tail —— baaiue 
[6£I19G conn 8398 
STOMS ss vv ee sds 

18402 — —— 
nok seoktioege-+.++-D9qa 
Jnoms1632..+n +68 


‘4 
8 GQ a, Cape 


Supnpese...<++DRta ar 


reas”: 


108 


form.....formula stan......Sstandard 

Mists ae SAN enbT Cy StOwlse« oes SLOWED 

iden... ocsidentifier BU shies SL LUCTULed With 
td cis. dbndreation Sie... «ss SU1Le 
init....einitialization suppress..suppressible 
insert...insertion SYeececeee Symbol 

U0 Gane «se LNLegGer UNLe sss 05s UNLtAry 

Pathe ss. sabanel: Wnd-oO¢ o2.. sunLomIes 

let... .. colabiien Wiw. val 

DO Ue as = 3 -lower bound V0 sedan 6 ee VOILA 


‘Terminal isproductions of this Syntax are “letter”, 
Hcharactentsetindicant", “dyindicant", “monindicant", and 
all entities ending with "*sy". For their representations, 


see Appendix C. 


program: clccl. 
labsety: empty; 
laksety, lab. 
empty: . 
lab: jidéenjelat*sy: 
iden: letter; 
iden, letter; 
idenynyadigaté 
apgivdidigdtsy ; 
dignoto. 
dignot0O: digi*sy; 
dig2*sy; 
dig3*sy; 
dig4*sy; 
dig5*sy; 
dig6*sy; 
dig7/*sy; 
dige*sy; 
dig9*sy. 


chause: clech; 
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éeoloci: 
Gosicl? 


cbocl: begin*sy, sercl, end*sy; 
open*sy, sercl, close*sy. 


colcl: begin*sy, kalance, end*sy; 
open*sy, btalance, close*sy; 
parallel*sy, begin*sy, balance, end*sy; 
parallel*sy, open*sy, balance, close*sy. 
balance: unit, ccmma*sy, unit; 
kalance, comma*sy, unit. 


coscl: ccndbeg*sy, sercl, condchoicl, condend*sy; 
casekeg*sy, sercl, casechoicl, casend*sy; 
confbeg*sy, sercl, confchoicl, confend*sy; 
cosheg*sy, sercl, choicl, cosend*sy. 
condchoicl: stropthef*sy, sercl, condchoicl; 
condincl, condoutclety. 
casechoicl: caseincl, caseoutclety. 
cont Ccho1cl: centincl, contoutclery. 
GhoiGl] e®ehovelt; 
enoirel 2: 
Ehoicis: 
enoicl TeRbrieithet*sy, sercl, choici; 
inebtpceutclety; 
inelipeoutclt. 
ehovel2svinelzZfecutclety; 
incl2FecuticlZ. 
GEOVeLIs Incispfioutclety; 
tee? 
condincl: condin*sy, sercl. 
caseincl: casein*sy, balance. 
confincly®éenfin* cy 7 *confunie; 
confin*sy, confbalance. 
LAER etin*Ssy7 tsercl. 
incl2: in*sy/;*balance. 
Ginewss Lo*syPptconfunit; 


in*sy, 
condoutclety: 


caseoutclety: 


confoutclety: 


confkalance. 

empty; 

condout*sy, sercl; 
condoutkeg*sy, sercl, 
empty; 

caseout*sy, sercl; 
caseoutbeg*sy, sercl, 
empty; 

confout*sy, sercl; 
confoutbeg*sy, sercl, 


outclety: empty; 

out*sy, sercl. 
outcl1: outkeg*sy, sercl, choicl’. 
outcl2: outbeg*sy, sercl, chcicl2. 


condchoicl. 


casechoicl. 


SouLongo Lc... 
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Butcla:scutkheg*sy, sercl, choicl3. 
confbalance: confunit, comma*sy, confunit; 
confbalance, ccmma*sy, confunit. 
Sones cee spec Unit. 
spec: open*sy, fodecer, iden, close*sy, choice*sy; 
open*sy, fodecer, close*sy, chcice*sy. 


sercl: suitrains; 
decprels, svitrains. 
decprels: decprel; 
decpgrels, decprel. 
decprel: decs, goon*sy; 
Staprels, decs, goon*sy. 
staprels:@unaty,, goon*sy; 
Staprels, unit, goon*sy. 
Suitrains strain: 
suitrains, completer, train. 
train: laksety, unit; 
train, goon*sy, labsety, unit. 
completer: completion*sy, lab. 


UnLEs tertiary: 
ctnden; 
repetitivecl; 
confront. 

tertiary: seccndary; 

formula. 
seccndary: primary; 
cohesicn. 
primary: clause; 
kase. 


Btnden:-ftoparpk, ,vovidecer, rtn*sy, unit; 
yovidecer,,rtn*sy, unit. 
Eoparpk: topen*sy, -fopars, clcse*sy. 
fopars: Lopars 
fopars, comma*sy, fofar. 
fopar: fodecer, iden; 
fopar, comma*sy, iden. 
vovidecer: videcer; 
void*sy. 


repetitivecl;: fromclety, byclety, toclety, foridenety, 
whileclety, docl. 
fromclety: empty; 
from*sy, sercl. 
byclety: empty; 
ky*sy, sercl. 
ftoclety..denptys 
to*sy, sercl. 
foridenety: empty; 
for*sy, iden. 
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whileclety: empty; 
while*sy, sercl. 
Dock: *do*sy ,;* Unt ct 


EouerTent. sacs. 


assig: 
idrel: 


formula: 


dyform: 


piform: 
PZ LCL m 
p3form: 
p4form: 
Dos .OLm 
poform: 
prror: 
p8ftorm: 
pPILornr: 


tertiary, 
LeCttiar ys 
TETtiaLry, 
cast: vovidecer, 


1drei* 
Cast. 


mcenform; 
dyform. 
plform; 
p2form; 
ESLOTUE 
PF4form; 
p5form; 
péforn; 
PYLOorm: 
Ppeforn; 
p9form. 
Floperand, 
p2operand, 
p3operand, 
p4operand, 
pooperand, 
pooperand, 
Evoperand, 
pSoperand, 
FIoperand, 


kecomes*sy, 
is*sy, 


ploperatcr, 
p2operator, 
p3operator, 
p4Yoperator, 
pSoperatcr, 
p6operator, 
ploperator, 
p8operator, 
p90peratcr, 


unit. 


ploperand: 
p2operand: 
p3operand: 
p4operand: 
pooperand: 
p6operand: 
p7operand: 
p8operand: 


p9operand: 


plQoperand: 


plform; 
p2cperand. 
p2forn; 
p3operand. 
p3fcrm; 
F4operand. 
p4form; 
pSoperand. 
pSform; 
réoperand. 
péform; 
p/Voperand. 
p7form; 
p8operand. 
p8forn; 
pSoperand. 
p9form; 


p10operand. 


monform; 


secondary. 


unit. 
tertiary; 
isnot*sy, tertiary. 
cast*sy, 


p2operand. 
p3operand. 
p4operand. 
p5operand. 
p6operand. 
p7operand. 
p8operand. 
p9operand. 
pl0operand. 
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mcnform: monind, pi0operand. 


ploperator: 
p2operator: 
p30perator: 
p4Yoperator: 
Properator: 
pooperator: 
p/operator: 
p8o0perator: 
p9operator: 


dyind. 
dyind. 
dyind. 
dyind. 
dyind. 
dyind. 
dyind. 
dyind. 
dyind. 


monind: shongsety, monindicant. 
dyind: shongsety, dyindicant. 


cohesion: generator; 
selecticn. 


generator: local*sy, acdecer; 
heap*sy, acdecer. 
of*sy, secondary. 


selection: iden, 


Ease: iden; 
den; 
jugE; 
skip*sy; 
nil*sy; 
slice: 
ead; 
vacuum. 


dens: shongsety, 
shongsety, 
ktoolden; 
charden; 
shongsety, 
stringden; 
formatden. 


intden; 
realden; 


btitsden; 


shongsety: empty; 
longs; 
shorts. 


longs: long*sy; 


longs, long*sy. 
Shorts: short*sy; 


shorts, 


intden: digits 
intden, 


realden: intden, 
— 
intden, 
expety: empty; 
exp. 


exp: letE*sy, flusminusety, 


shcert*sy. 


digit. 


pt*sy, intden, 
intden, expety; 
EXP. 


expety; 


intden. 
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plusminusety: empty; 
plus*sy; 
Minus*sy. 


boolden: true*sy; 
false*sy. 


Charden: kriefqucte*sy, stringitem, briefquote*sy; 
Stropgquote*sy, stringitem, stropquote*sy. 
stringitem: briefquote*sy, briefquote*sy; 
stropquote*sy, stropquote*sy; 
character. 


bitsden: dig2*sy, letk*sy, radix2bytes; 
dig4*sy, letk*sy, radix4bytes; 
dig8*sy, letk*sy, radix8bytes; 
digi*sy, dig6*sy, letR*sy, radixi6bytes. 
radix2bytes: radix2byte; 
Ladix2bytes, radix2byte. 
radix4bytes: radix4byte; 
radix4bytes, radix4byte. 
radix8bytes: radix8byte; 
radix8bytes, radix8byte. 
radixl6bytes: radixl6byte; 
radixl6bytes, radixtl6byte. 
radix2byte: digQ*sy; 
digi*sy. 
radix4byte: radix2byte; 
dig2*sy; 
dig3*sy. 
radix&byte: radix4byte; 
dig4*sy; 
dig5*sy; 
dig6*sy; 
dig?*sy « 
radixl6byte: radix8byte; 
dugdts y; 
dig9*sy; 
letA*sy; 
letB*¥sy; 
letcC*sy; 
letD*sy; 
letE*sy; 
letF*sy. 


stringden: briefquote*sy, briefquote*sy; 
stropquote*sy, stropquote*sy; 
triefquote*sy, stringitemsp, briefquote*sy; 
stropquote*sy, stringitemsp, stropquote*sy. 
stringitemsp: stringitem, stringiten; 
stringitemsp, stringitem. 


ere 
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formatden: formatter*sy, collects, formatter*sy. 
€ciiects :scoliect*s 
collects, comma*sy, collect. 
Collect: i pictures 
insertety, replicety, open*sy, collects, close*sy, 
insertety. 
picture: patternety, insertety. 
patternety:”* empty; 
intpat; 
realpat; 
boclpat; 
charpats 
stringpat; 
ccmplexpat; 
DAtSpar. 
insertety: literalety, insertsety. 
insertsety: empty; 
insertsety, insert. 
insert: replicety, align, literalety. 
replicety: empty; 
Treplic. 
Geplicspintdens 
letN*sy, clause. 
align: letK*sy; 
Leti*sy; 
letY*sy; 
letL*sy3; 
letP*sy. 
literalety: empty; 
literal. 
literal: replicety, stringcharden, replitsety. 
replitsety: empty; 
replitsety, replic, stringcharden. 
stringcharden: charden; 
stringden. 
Signmoety: empty; 
insertety, signframe; 
insertety, replicety, zercoframe, signframe. 
zeroframe: letZ*sy. 
Signframe: pflus*sy; 
minus*sy. 


intpat: signmoety, intmo; 
insertety, letC*sy, cpen*sy, literals, close*sy. 
intmc: insertety, replicety, suppressety, digframe; 
intmo, insertety, replicety, suppressety, digframe. 
digframe: zercframe; 
letr*sy. 
suppressety: empty; 
letS*sy. 
hibecalst:literal; 
literals, ccmma*sy, literal. 
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realpat: signmoety, realw#o; 
floptmo. 
realmo: intmo, insertety, suppressety, ptframe, intmoety; 
; insertety, suppressety, ptframe, intmc. 
PucMmoetly: empty; 
intro. 
PELPaMe: “pr*sy. 
floptmo: stagmo, insertety, suppressety, expframe, 
signmoety, intmo. 
stagmo: signmoety, intmo; 
signmoety, realmc. 
expframe: letk*sy. 


toolpat: insertety, letB*sy, boolmoety. 
Peormeety: empty; 
open*sy, literal, comma*sy, literal, close*sy. 


charpat: insertety, suppressety, charframe. 
charframe: letA*sy. 


coliplexpat: realpat, insertety, suppressety, complexframe, 
realpat. 
complexframe: letiI*sy. 


stringpat: insertety, stringframe; 
charframesp; 
insertety, replic, suppressety, charframe. 
stringframe: letT*sy. 
charframesp: insertety, replicety, suppressety, charframe, 
insertety, replicety, suppressety, charframe; 
charframesp, insertety, replicety, suppressety, 
charframe. 


bitspat: radixmo, intmo. 
radixmo: insertety, radix, letR*sy. 
Padic Pol getsy ; 

dig4*sy; 

dig8*sy; 

digi*sy, dig6*sy. 


jump: goto*syy ‘iden’ 


slice: primary, suk1*sy, indexers, busi*sy; 
primary, suk2*sy, indexers, busz*sy; 
primary, open*sy, indexers, close*sy. 
indexers: trimscriptety; 
indexers, comma*sy, trimscriptety. 
trimscriptety: subscript; 
trimmerety. 
Swhecra pte suns. 
trimmerety: boundety, upto*sy, boundety, newlobety; 
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newlokety. 
boundety: empty; 
unit. 
newlobkety: empty; 
attsy, Units 


call: primary, ofen*sy, acpars, close*sy. 
acpars;: .acpar; 

acpars, ccomma*sy, acpar. 
aGpari,Unit, 


vacuum: open*sy, close*sy. 


decs: unidec; 
decs, ccmma*sy, unidec. 


unidec: modedec; 
strdec; 
uniodec; 
pridec; 
iddec; 
opdec. 


modedec: mode*sy, modeind, equal*sy, acdecer; 

modedec, comma*sy, modeind, equals*sy, accecer. 
strdec: str*sy, modeind, equals*sy, acfieldpk; 

strdec, comma*sy, modeind, equals*sy, acfieldpk. 
acfieldpk: ofpen*sy,, acfieldors, .close*sy. 
uniodec: union*sy, modeind, equals*sy, videcerspk; 
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uniodec, comma*sy, modeind, equals*sy, videcerspk. 


videcerspk: open*sy, videcersp, close*sy. 
videcersp: videcer, ccmma*sy, videcer; 
videcersp, comma*sy, videcer. 


pridec: pri*sy, dyind, equals*sy, dignot0; 
pridec, comma*sy, dyind, equals*sy, dignoto. 


iddec: iddecis; 
iddec2s; 
iddec3s. 
iddecis: fodecer, idacpar; 
iddecis, comma*sy, idacpar. 
idacpar: iden, equals*sy, acpar. 
iddec2s: heapety, acdecer, ideninit; 
iddec2s, comma*sy, ideninit. 
heapety: empty; 
heap*sy. 
ideninit: iden; 
iden, becomes*sy, unit. 
iddec3s: proc*sy, idrtnden; 
idecdec3s, comma*sy, idrtnden. 
idrtnden: iden, equals*sy, rtnden. 
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opdec: opdecis; 
opdec2s. 
opdects?top*sy;, Opdcpar 1; 
opdecis, comma*sy, cpacpar’. 


Opacpar1: viplan, adicind, equals*sy, unit; 
adicind, equals*sy, rtnden. 
opdec2s: op*sy, viplan, opacpar2; 
opdec2s, comma*sy, cpacpar2. 
Oopacpar2: adicind, equals*sy, unit. 
viplan: open*sy, videcer, close*sy, vovidecer; 
ofpen*sy, videcer, comma*sy, videcer, close*sy, 
vovidecer. 
adicind: monind; 
dyind. 
acdecer: acnosdor; 
acstcdor; 
modeind. 
acncsdor: procdor; 
aniodor ; 
acrefdor. 
acrefdor: ref*sy, videcer. 
acstodor: Yacstrdor; 
acrowdor. 
acstrdor: str*sy, acfieldpk. 
acfieldors: acfieldor; 
acfieldors, comma*sy, acfieldor. 
acfieldor: acstodor, iden; 
vincesdor, iden; 
modeind, iden; 
acfieldor, comma*sy, iden. 
acrowdor: sub1*sy, acrowers, bus1*sy, acstodor; 


suk2*sy, bus2*sy, 
open*sy, 
subi*sy, 
suk2*sy, 
open*sy, 
sukbki*sy, 
subz*sy, 
open*sy, 
aCLOWEL; 
aCLOWELIS, 
ackound, upto*sy, 
unit; 
unat, 


acroweLrs, 
acrowers, 
acrowers, 
acrowers, 
acLOWEeIs, 
acrowers, 
acCrowers, 
acLtrowers, 


bus1*sy, 
bus2*sy, 


bus1*sy, 
bus2*sy, 


acrowers: 
ccomma*Sy, aCrower. 
acrower: acbound. 
acbound: 


flex*sy. 


fonosdor; 

fostodor; 

modeind. 
procdor; 
uniodor; 


fodecer: 


LoOnesaqoL: 


close*sy, 


close*sy, 


close*sy, 


acstodor; 
acstcdor; 


vinosdor; 


vinosdor; 
vinosdcr; 


modeind; 
modeind; 


modeind. 
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forefdor. 
forefdor: ref*sy, fostodor; 
ref*sy, vinosdor. 
festodor ieLostrdor ; 
forcwdor. 
fostrdor: str*sy, open*sy, fofieldors, close*sy. 
fotieldorssfofieldon: 
fofieldors, conma*sy, fofieldor. 
fofieldor:-fostodor, iden; 
vincesdor, iden; 
modeind, iden; 
fofieldor, comma*sy, iden. 
forowdor: suki*sy, forowersety, bus1i*sy, fostodor; 
sub2*sy, forowersety, bus2*sy, fostodor; 
open*sy, forowersety, close*sy, fostodor; 
subi*sy, forowersety, bus1*sy, vincsdor; 
subz*sy, forowersety, bus2*sy, vinosdor; 
Orfen*sy, forowersety, close*sy, vinosdor; 
subi*sy, forowersety, busi*sy, modeind; 
sukz*sy, forowersety, bus2*sy, modeind; 
open*sy, forowersety, close*sy, modeind. 
fcrowersety: forcwerety; 
forowersety, comma*sy, forowerety. 
forowerety: empty; 
fokoundety, upto*sy, foboundety. 
foboundety: empty; 
flex*sy; 
either*sy. 


videcer: vinosdor; 

vistcodor; 

modeind. 
Vinosdor: procdor; 

uniodor; 

virefdor. 
virefdor: ref*sy, videcer. 
wEstouob: wVistrdors 

vircwdor. 
vistrdor: str*sy, open*sy, vifieldors, close*sy. 
vifieldors: vifieldor; 

vifieldors, comma*sy, vifieldor. 
vifieldor: videcer, iden; 
vifieldor, comma*sy, iden. 

vircwdor: subi*sy, virowersety, busl*sy, videcer; 

sub2*sy, virowersety, bus2*sy, videcer ; 

open*sy, virowersety, Close*sy, videcer. 
virowersety: virowerety; 

virowersety, comma*sy, virowerety. 
virowerety: empty; 
upto*sy. 


procdcr: proc*sy, vovidecer; 
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uniodor: 
modeind: 


modestan: 


proc*sy, open*sy, videcer, close*sy, 
proc*sy, videcerspk, vovidecer. 
unio*sy, videcerspk. 


modestan; 

indicant. 
shongsety, 
shongsety, 
bool*sy; 
char*sy; 
format*sy; 
shongsety, 
shongsety, 
string*sy; 
sema*sy; 
shongsety, 
file*sy. 


int*sy ; 
real*sy; 


bits*sy; 
bytes*sy; 


complex*sy; 


vovidecer; 
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APPENDIX C 


A REPRESENTATION LANGUAGE 


This representation language for ALGOL 68 was designed 
for the cne-pass implementation proposed in this thesis. The 
symbols for which representations are given correspond to 
terminals of the syntax presented in Appendix A. Where such 
a symkol is mentioned, the corresponding terminal from the 
context-free syntax given in Appendix B, if one exists, 


follews in parenthses. 


The "stropping" convention (method of representing 
bold-faced or underlined symbols) chosen is that of a "bold- 
face shift", that is, an apostrophe followed by a 
sequence cf letters (upper cr lower case) or digits, ending 
cn a “light-face shift", that is, any mark different from 
the representation of a letter or digit. A symbol formed in 
this fashion is called a “strop-word". Note that ‘IF is a 


different strcop-word than ‘if. 


The representations are grouped in an order similar to 


that used in Chapter 3 of the Report. 
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1. Letter Tokens 


The fproducticn for the metanotion "ALPHA" is altered 

[1.1.4] as follows; 

ALORA ashb: “svagecsyllowsrt case 
lower Case Z. 

Then the follcwing are representations for letter symbols; 
letter a symkcl (letA*sy) 
letter lcwer case a symbol 


letter b symkol (letB*sy) 
letter lcwer case b symbol 


owl Ye 


letter z symkol (letZ*sy) 
letter lower case z symbol 


NWN 


The terawinal preduction “letter" of the context-free syntax 


includes all letter-tokens. 


2. Denotation Tckens 


Digit Tckens. 


digit zero symbol (dig0*sy) 0 
digit one symbol (dig1*sy) 1 
digit nine symbol (dig9*sy) 9 
Following are other denotaticn-tokens. 
Eeint symkol (pt*sy) : 
plus symkol (plus*sy) + 
minuse symbol (minus*sy) * 
true symbol (true*sy) 'TRUE 
false symbol (false*sy) IFALSE 
formatter symbol (formatter*sy) $ 
No representation has been provided for the "times-ten-to- 


the-pcower-symtol" (this was not included in the context-free 


syntax). 
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Ss. Charactestioketis 


These «tckens* are used in string- and: character- 
denotations and in ccmments. Representations for all of them 
except the "space-symbol" are given elsewhere in this 
Appendix... Note;«:in«s Appendix A, that "plus-i-times-symbol" 
has been omitted from, and that plus-symbol and minus-symbol 
have feen added to, the list of character-tokens as given in 
the Report. The space-symbol is represented by a blank, 
which grewilluckesysignificantsas it will appearcin fstring=for 
Character- denotations or comments. Insignificant blanks 
within the source text may appear anywhere that comments 


may; that is, between but not within symbols. 


The terminal "character" of the context-free syntax 
ccrresponds to the above character-tckens plus all terminal 
productions cf "cther-string-item" and “other-comment-iten". 
Productions are added [R1.1.5.c] for other-comment-item and 
cther-string-item so that their terminal productions include 
most, if not all, available graphic symbols of the physical 
machine, apart from those already included as character- 
tokens. Note that other-comment-item does not include 
"comment-symkcol" and other-string-item does not include 
quote-symkbol among their respective terminal productions 


(R1. 1a2de]. 
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Confrcntation Tokens 


kecomes symbol (kecomes*sy) 
is symbol (is*sy) 

is nct symbol (isnot*sy) 
cast of symkol (cast*sy) 


Declaration Tokens 


long symrol (long*sy) 
short symbol (short*sy) 
structure symbol (str*sy) 
reference to symbol (ref*sy) 
flexible symbol (flex*sy) 
€ither symbol (either*sy) 
procedure symbol (proc*sy) 
union of symrol (uniox*sy) 
mode symtol (mode*sy) 
Priority symbol (pri*sy) 
local symbol (local*sy) 
heap symbcl (heap*sy) 
operation symbol (op*sy) 
void symbol (void*sy) 


Syntactic Tokens 


cpen symkcl (open*sy) 

close symbol (close*sy) 

begin symbol (begin*sy) 

end symktol (end*sy) 

comma symbol (comma*sy) 

parallel symbol (parallel*sy) 

sub symbol (sub1*sy) 

alternate suk symbol (sub2*sy) 

tus symbol (kus1*sy) 

alternate bus symbol (bus2*sy) 

upto symkol (upto*sy) 

at symbol (at*sy) 

of symbol (of*sy) 

label symbol (lab*sy) 

routine symbol (rtn*sy) 

equals symbol (equals*sy) 

brief CCNFASE begin symbol 
(cosbeg*sy) 

strop conditional begin symbol 
(condbeg*sy) 

strop case begin symbcl 
(casebreg*sy) 
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strep conformity case begin symbol 


(confbeg*sy) 
brief thef symbol (briefthef*sy) 
strop thef symbol (stropthef*sy) 
brief CCNFASE in symbol (in*sy) 
strop conditional in symbol 

(condin*sy) 
strop case in symbol (casein*sy) 
strep conformity case in symbol 

(confin*sy) 
Erief CCNFASE out symbol (out*sy) 
strop conditional out symbol 

(condout*sy) 


strop case out symbol (caseout*sy) 


strep conformity case out symbol 
(confout*sy) 


bkrief CCNFASE end symbol (cosend*sy) 


strop conditional end symbol 
(condend*sy) 
strop case end symbol (casend*sy) 
strcp conformity case end symbol 
(confend*sy) 
brief CCNFASE out begin symbol 
(outbeg*sy) 


strop conditional out begin symbol 


(ccndoutbeg*sy) 
strop case out begin symbol 
(caseoutbeg*sy) 


gal eit 


CUS 


strep conformity case out begin symbol 


(confoutbeg*sy) 
choice symbol (choice*sy) 
from symrol (from*sy) 
ky symbol (by*sy) 
to symbol (to*sy) 
for symbol (for*sy) 
while symbol (while*sy) 
do symbol (do*sy) 


quencing Tokens 


ao on symkol (goon*sy) 
completion symbol (ccmpletion*sy) 
qo tc symbol (goto*sy) 


Hip Tckens 


skip symrkcl (skip*sy) 
nil symbol (nil*sy) 
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9. Special Tokens 


brief quote symbol (briefquote*sy) A 


strop quote symbol (stropquote*sy) "QUCTE 
brief comment symbol tee ame ae 
strop comment symbol TCOMMENT CU 


"PRAGMAT 'PR 
When the quote-symbol ‘QUOTE commences a Str 2n0— Or 
character-denotation, then the blank following it, if there 
is one, “is not’ part of “the string- or character-denotation. 
For ‘example, ‘QUOTE MUGUOTEH Cisttthet sande -ds 4A AMWhile 
‘QUOTE; "QUOTE is the same ag "3", This is a consequence of 


the chosen stropping convention. 


10. Mode Standards 


integral symbol (int*sy) TINT) SINTEGER 
real symbol (real*sy) "REAL 

koolean symbol (bool*sy) 'BOCL 'BOOLEAN 
character symbol (char*sy) ‘CHAR ‘CHARACTER 
format symtol (format*sy) "FORMAT 

bits symkcl (bits*sy) SBGLS 

Lytes symbol (bytes*sy) UBY TES 

string symbol (string*sy) 'STRING 

sema symktol (sema*sy) TSEMA 

complex symkcl (complex*sy) 'CORPL® *GGMPLEX 
file symtol (file*sy) CPLLe 


The alternatives given are defined by inserting declarations 
such as 
'MODE "INTEGER = ‘INT 


into the standard-prelude. 


V1. Other Terminals 


Further terminals in the syntax are indicant, dyadic- 


indicant, and monadic-indicant. In the context-free syntax 
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these are! “indicant", "“dyindicant", and "“monindicant" 
respectively. Productions for these may be added [R1.1.5.c], 
and this is done as follcws; 
indicant: strep word, 
dyadic indicant: strop word; 
special character dyadic indicant. 
monadic indicant: strop word; 
special character monadic indicant. 
Special characters are those characters other than letters 


and digits. 


No strop-word may be a terminal production of both 
indicant and monadic-indicant [R1.1.5.b]. No strop-word that 
isagivencinesectionsartwosethzoughd ninéigofigthis tappendix 
(these may ke called “program-words") may be a terminal 
Froducticn of indicant, monadic-indicant, Or dyadic- 
indicant. No representation ccmprising special characters 
given in sections two through nine of this Appendix (these 
may be called "“special-symbols") may ke froduced by 
indicant, dyadic-indicant, or monadic-indicant, with the 
exception of #y ~f# and@=. Whether these marks are special- 
symbols or not is easily distinguishable by context. These 
conditions and the chosen stropping convention ensure that 
"no sequence of representations of symbols can ke confused 
with any other such sequence" [R3.1.2.c]. An example of an 
ambiguity that results without the above conditions is as 


follows: 
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Note that since indicants, dyadic-indicants, and 
monadic-indicants are themselves symbols, no blanks or 


ccmments may ke embedded within them. 


The entity "special-character-dyadic-indicant" is 


denoted by "D" and given as follows: 


D: D , Wx"; 
By pez 
Dhol: o: 
Dey oe 
tT; 

WeM s 
uss 


Special characters have been quoted to distinguish them from 
syntactic marks. "I" has as its direct productions each 
member of a .set ‘of special characters not including "*", 
Leis UOT. La Us LUE LL W (i 48) UW Ws MU Wg MC UG we MU —— 
ULtee Ge ayes, opt, MIN, 8",  cOLreee". ~Theventity 


"special-character-monadic-indicant" is denoted by "M" and 


given as follows: 


Me io, WEN: 
M ’ —— 
M A LL : 
Mhoit=" : 
ciate 
UENeei] cedefinsdeas above. Now it "T" producest"+", NaN, Mam, 


Wn naw nga, wz" and "%" then the above rules allow 
representations as given below for standard “operator- 


tokens", whose corresponding declarations will ke included 
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in the standard-prelude. For mnemonic reasons, the use of 
"AB" as a suffix for "“and-becomes" is consistent. Where 
alternatives are given, these are introduced by declaration 
in the standard-prelude sc there will be only one 
representaticn per indicant, though scme indicants may 


initially have the same meaning. 


tinus and keccmes symbol EMINUSAB) —<7= 
plus and tecomes symbol ‘PLUSAB +:= 
times and beccmes symbol "TIMESAB *:= 
divided ty and becomes symbol 'DIVAB(/:= 
over and keccmes symbol ‘OVERAB h:= 
modulo and beccmes symbol (MOTAB. 43 2= 
prus and kecomes symbol 'PRUSAE +=: 
or symbcl OR 

and symbol "AND & 
differs frcem symbcl 'NE n= # 

is less than symkcl a se 

is at most symbol 'LE <= 

is at least symrol "GE >= 

is greater than symbol 'GH >? 
divided Ly symbol DLV: 

over symkcl "OVER % 
times symbol Lor uron 
add symbol "PLUS + 
subtract symkcl Pu NU ses 
module symbol 'MOD &: 

th element of symbcl "ELEM 

lower bound of symbol "IWB 

upper ktound of symbol "UPB 

lower state of symkol "LWS 

upper state of symbol ‘UES 

entier symkol "ENTIER 

net symtrol "NOT 

down symbol "DCWN 

up symbcl 02 
integral to sema symbol "LEVEL 

sema to integral symbol "LEVEL 

plus i times symrol 1 

power symkol "POW ** 
shift left symkol Piohy ae 

shift right symrol "SHE 
absolute value of symbol "ABS 

bkinal symkol ‘BIN 
representation of symbol "REPR 
lengthen symkol ‘LENG 
shorten symkol 'SHCRTEN 
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odd symrcl YODD 
Sign symbol 'SIGN 
round symkol "RCUND 
real part of symtrol URE 
imaginary fart of symbol Le 
conjugate of symbol "CCNJ 
argument of symkol "ARG 
boolean to bits symbol "PACK 
characters to kEytes symbol PACK 
equality symbcl 'EFQ = 


It should ke noted that the above syntax for special- 
character-monadic- and dyadic-indicants does not permit *, 
=, Or / aS monadic-indicants, nor are they permitted to 
appear at the start of monadic-indicants. This is to prevent 
ambiguities andyor indeterminations from occurring. For 
exile ele AU “Ase ——— 
monadic: A<=B ‘could be A <= B or <A < =B°'3£ = could be 
monadic; in (/:A ..., the (/ may be an alternate-sub-symbol 
or /: may be a monadic-indicant (this is resolvable at the 
matching bkus-symbcl or close-symbol, but the indetermination 
is not desiracrle). TIMES SS bO, ana, ('DLV, | however, sare 
permitted as monadic-indicants. Note that :7=: was not given 
as an alternate representation for the is-not-symbol. This 
is because A:7=:B might be A: 7=:B where A: is a label and 
-=:B is a monadic-formula or A :-=; B where :-=: is the is- 


not-symbcl. 
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